Interesting administrative position for anyone worldwide with
* Familiarity with library, archive, or museum collections and practices
* Knowledge of web archiving
* Experience of work within a research institution or library
#webarchiving #archive #library #jobannouncement
Am 24. August drehte sich bei der @DNB_Aktuelles in Frankfurt am Main alles um Webarchivierung. Für alle, die nicht bei sein konnten, und jene, die nochmal nachlesen möchten: Die Präsentationen der Vorträge sind nun online!
📆This #WebArchiveWednesday marks just over one month before the CfP for #iipcWAC24 closes. Start thinking about your submission today!
📢CfP: #WebArchives in Context📢
netpreserve.org/ga2024/cfp/
🟣PROPOSALS DUE SEPT 24
🟣#iipcWAC24 24-26 APR 2024
🇫🇷NATIONAL LIBRARY OF FRANCE (BnF), PARIS, FRANCE
🖥 #WebArchiving 🎉#iipc20years
💾#DigitalPreservation 📖#DigitalHumanities
#digitalhumanities #digitalpreservation #iipc20years #webarchiving #webarchives #iipcwac24 #webarchivewednesday
What about that metadata that is present? Grusky et al. (https://doi.org/10.18653/v1/N18-1065 ) realized that, because page authors create that metadata, it can serve as ground truth to evaluate #Automatic #Summarization.
We analyzed pages from #WebArchiving and saw how this metadata evolved. By 2010 we saw a metadata explosion with the use of #Twitter Cards, Open Graph Protocol, #Facebook Tracking, and more. Things like Twitter cards created a metadata renaissance for HTML.
#automatic #summarization #webarchiving #Twitter #facebook
In 2020, we developed a special tool, MementoEmbed, for generating/extracting metadata from archived web pages. We presented this tool at the Web Archiving and Digital Libraries Workshop (WADL2020).
We found out that #Twitter, #Facebook, #Tumblr, and others could not reliably create cards for archived web pages. We use MementoEmbed’s cards in #Storytelling with our tool Raintale to create a #Visualization of this #Summarization.
#Twitter #facebook #tumblr #storytelling #visualization #summarization #webarchiving #digitalpreservation
I am just noticing that the French proto-socialmedia site Skyblog (now Skyrocket) is shutting down, and that the Bibliothèque nationale de France (BNF) and the Institut national de l’audiovisuel (INA) were asked to archive the 19 million blogs, I guess using #webarchiving tech? (I need to find someone with a Le Monde subscription to read the rest of the article):
@fourjuaneight I wrote a blog post in 2019 in response to #Google+ going offline (https://ws-dl.blogspot.com/2019/02/2019-02-08-google-is-being-shuttered.html). My post, along with other sources, was applied by the glorious all-volunteer ArchiveTeam (https://wiki.archiveteam.org) coordinated by @textfiles. They preserved as much of Google+ as they could before it was gone. They also have projects to preserve #Reddit, #Imgur, #Telegram, #GitHub, #YouTube, and many .ua #Ukraine sites.
#google #reddit #imgur #telegram #github #youtube #ukraine #twittermigration #digitalpreservation #webarchiving
📢We have a new #iipcGA23 #iipcWAC23 #iipc20Years wrap-up post on the blog: https://netpreserveblog.wordpress.com/2023/08/16/2023-general-assembly-and-web-archiving-conference-wrap-up/
🙌Thanks again to everyone who helped make this year's conference a success!
#webarchivering #webarchiving #iipc20years #IIPCWAC23 #iipcga23
#Twitter's pic issue is one of those moments demonstrating WHY WE NEED #DigitalPreservation & #WebArchiving: https://www.theguardian.com/technology/2023/aug/21/elon-musk-x-glitch-deletes-twitter-photos-pictures-links
It highlights the important work of:
@BrendaReyesAyala
@VickyRampin
@anj
@archivist_Liz
@azaroth42
@brewsterkahle
@edsu
@electricarchaeo
@euanc
@ibnesayeed
@ingridbmason
@internetarchive
@liblaura
@martinklein
@mickylindlar
@netpreserve
@peterwebster
@textfiles
@webrecorder
@weiglemc
And so many more. Please reply with others. #TwitterMigration
#Twitter #digitalpreservation #webarchiving #twittermigration
🎥#iipcWAC23 #iipc20Years recordings are now available to view!
🔵Check out the full list here on YouTube: https://youtube.com/@iipc8855/playlists?view=50&shelf_id=1
🔵Browse the final #iipcWAC23 program: https://netpreserve.org/ga2023/programme/wac/
🙌Thank you to all of our wonderful #iipcWAC23 presenters, session chairs, & co-authors for sharing their work with us!
#webarchivering #webarchiving #webarchives #iipc20years #IIPCWAC23
📢CfP: #WebArchives in Context📢
https://netpreserve.org/ga2024/cfp/
🟣PROPOSALS DUE SEPT 24
🟣#iipcWAC24 24-26 APR 2024
🇫🇷NATIONAL LIBRARY OF FRANCE (BnF), PARIS, FRANCE
🖥 #WebArchiving 🎉#iipc20years
💾#DigitalPreservation 📖#DigitalHumanities
Thanks very much to our Program Committee for their work in putting this CfP together! (https://netpreserve.org/ga2024/organization)
#digitalhumanities #digitalpreservation #iipc20years #webarchiving #iipcwac24 #webarchives
Am 24.8.2023 geht es in einer nestor-Veranstaltung um "Webarchivierung - Praxis und Perspektiven". Die Anmeldung ist noch bis 4. August 2023 möglich! Mehr Infos unter: https://www.langzeitarchivierung.de/Webs/nestor/DE/Veranstaltungen_und_Termine/2023Webarchivierung.html
#digipres #webarchiving
UK Web Archive Technical Update - Summer 2023 - UK Web Archive blog #WebArchiving #WebArchiveWednesday https://blogs.bl.uk/webarchive/2023/07/ukwebarchivetechnicalupdate-summer2023.html
#webarchivewednesday #webarchiving
Today in dodgy #SpatialHumanities #WebArchiving bodges, I'd like to introduce the 'tile suck'.
So if you use most conventional automated archiving tools on a website with a leaflet map on it (in this example a #Curatescape Mapbox map for the @PortsPastPres website), you may notice that they only ever capture the map tiles visible on the screen.
Not good enough, I say! So my strategy is this:
1) Get the biggest screen you can find. Biiiiiiig. (zooming out on the browser doesn't seem to quite work)
2) Open up the map with the tileset that you want to hoover up into your WARC record (probably in Chrome because that's what Archiveweb.page works on). Set your Leaflet or whatever map viewer to full screen. Start your crawl.
3) Zoom around at each level of zoom for areas of zoom and focus on your map. I'm using a point cluster map so I focus on the areas where my points are. The big screen means that whenever you hit an undownloaded tile, it'll pull it in.
#spatialhumanities #curatescape #webarchiving
First workshop on #webarchiving with #Webrecorder tools at #DH2023 con & very happy to be here for this convergence of #DH & webarchiving communities (hint: the former really needs more engagement with the latter ;)) with Ilya Kreymer & Jasmine Mulliken.
[PS At the con for the whole week, so if anyone here is in Graz, DM for meetups]
#webarchiving #webrecorder #DH2023 #dh
the NSZL uses a set of scripts and natural language processing tools to extract and clean the text from the archived web pages. #webarchiving #researchdata #LIBER2023
#webarchiving #researchdata #liber2023
It took a long time to write it, and now you have to read it! A new blog post: Robust file transfers with Rclone https://anjackson.net/2023/07/04/robust-file-transfers-with-rclone/ #WebArchiving #DigiPres
Archives Research Corporate Hub:
new research and education service that helps users easily build, access, and analyze digital collections computationally at scale
https://blog.archive.org/2023/06/26/build-access-analyze-introducing-arch-archives-research-compute-hub/
@thomas @internetarchive
Archives Unleashed (Ian Milligan et al), Mellon Foundation
#WebArchiving @histodons
Some reflections on IIPC Web Archiving Conference 2023: https://anjackson.net/2023/06/20/reflections-on-the-iipc-web-archiving-conference-2023/ #WebArchiving (cross-posted from UKWA blog).