Identifying “Soft 404” error pages: analyzing the lexical signatures of documents in distributed collections
Collections of Web-based resources are often decentralized; leaving the task of identifying
and locating removed resources to collection managers who must rely on http response …
and locating removed resources to collection managers who must rely on http response …
[HTML][HTML] Moved but not gone: an evaluation of real-time methods for discovering replacement web pages
Abstract Inaccessible Web pages and 404 “Page Not Found” responses are a common Web
phenomenon and a detriment to the user's browsing experience. The rediscovery of missing …
phenomenon and a detriment to the user's browsing experience. The rediscovery of missing …
Open annotations on multimedia Web resources
Many Web portals allow users to associate additional information with existing multimedia
resources such as images, audio, and video. However, these portals are usually closed …
resources such as images, audio, and video. However, these portals are usually closed …
Detecting off-topic pages within TimeMaps in Web archives
Web archives have become a significant repository of our recent history and cultural
heritage. Archival integrity and accuracy is a precondition for future cultural research …
heritage. Archival integrity and accuracy is a precondition for future cultural research …
Reviving Dead Links on the Web with Fable
J Zhu, A Nyayachavadi, J Zhu… - Proceedings of the …, 2023 - dl.acm.org
The web is littered with millions of links which previously worked but no longer do. When
users encounter any such broken link, they resort to looking up an archived copy of the …
users encounter any such broken link, they resort to looking up an archived copy of the …
Shelf life: Identifying the abandonment of online digital humanities projects
L Meneses, R Furuta - Digital Scholarship in the Humanities, 2019 - academic.oup.com
A large portion of the research carried out in the digital humanities has an online digital
object (usually referred as a project) as one of its components. In turn, these online digital …
object (usually referred as a project) as one of its components. In turn, these online digital …
[图书][B] Using web archives to enrich the live web experience through storytelling
Y AlNoamany - 2016 - search.proquest.com
Much of our cultural discourse occurs primarily on the Web. Thus, Web preservation is a
fundamental precondition for multiple disciplines. Archiving Web pages into themed …
fundamental precondition for multiple disciplines. Archiving Web pages into themed …
[图书][B] Web archive services framework for tighter integration between the past and present web
A AlSum - 2014 - search.proquest.com
Web archives have contained the cultural history of the web for many years, but they still
have a limited capability for access. Most of the web archiving research has focused on …
have a limited capability for access. Most of the web archiving research has focused on …
Reading the correct history? Modeling temporal intention in resource sharing
HM SalahEldeen, ML Nelson - Proceedings of the 13th ACM/IEEE-CS …, 2013 - dl.acm.org
The web is trapped in the" perpetual now", and when users traverse from page to page, they
are seeing the state of the web resource (ie, the page) as it exists at the time of the click and …
are seeing the state of the web resource (ie, the page) as it exists at the time of the click and …
Find, new, copy, web, page-tagging for the (re-) discovery of web pages
Abstract The World Wide Web has a very dynamic character with resources constantly
disappearing and (re-) surfacing. A ubiquitous result is the “404 Page not Found” error as …
disappearing and (re-) surfacing. A ubiquitous result is the “404 Page not Found” error as …