Tim Sherratt · @wragge
1090 followers · 1001 posts · Server hcommons.social

๐Ÿ“ฃ 119,085 digitised newspaper articles added to last week. Once again they're mostly (112,604) from the Sydney Daily Mirror, 1944-45. But there's also 6,481 added to the Kyabram Free Press and Rodney and Deakin Shire Advocate in 1954.

See the Trove Data Dashboard: wragge.github.io/trove-newspap

#trove #glam #histodons

Last updated 1 year ago

Tim Sherratt · @wragge
1087 followers · 996 posts · Server hcommons.social

So has already used machine learning to improve the OCR of at least 10 million newspaper articles: nla-overproof.projectcomputing

#trove

Last updated 1 year ago

Tim Sherratt · @wragge
1087 followers · 996 posts · Server hcommons.social

Today in the Data Guide โ€“ I think the getting data from newspaper pages section is nearly finished: wragge.github.io/trove-data-gu

#trove #wip

Last updated 1 year ago

Tim Sherratt · @wragge
1083 followers · 976 posts · Server hcommons.social

Documentation is hard. Every time I work on a section in the Data Guide I realise I need to update/create several other sections. It just keeps getting bigger. Anyway, nearly finished this 'HOW TO' on harvesting a complete set of search results using the Trove API: wragge.github.io/trove-data-gu

#trove #digitalhumanities #glam

Last updated 1 year ago

Tim Sherratt · @wragge
1083 followers · 976 posts · Server hcommons.social

I'm continuing to log bugs in the v3 API here: github.com/GLAM-Workbench/trov (Trove itself doesn't have any public list of issues/bugs)

#trove

Last updated 1 year ago

Tim Sherratt · @wragge
1084 followers · 969 posts · Server hcommons.social

@warpedtime If you can bear to use FB, there is an unofficial user group: facebook.com/groups/troveuserg

#trove

Last updated 1 year ago

Tim Sherratt · @wragge
1084 followers · 969 posts · Server hcommons.social

๐Ÿ“ฃ Just like last week the only change to 's digitised newspapers in the past week has been the addition of more articles from the Sydney Daily Mirror โ€“ 123,565 articles from 1944-45.

See the Trove Newspaper Data Dashboard for more: wragge.github.io/trove-newspap

#trove

Last updated 1 year ago

Tim Sherratt · @wragge
1083 followers · 967 posts · Server hcommons.social

Just resubmitted a bug report from 2021 as it's still not fixed -- affects advanced search when filtering by holding organisation.

#trove

Last updated 1 year ago

Tim Sherratt · @wragge
1083 followers · 966 posts · Server hcommons.social
Tim Sherratt · @wragge
1085 followers · 961 posts · Server hcommons.social

I've been playing around a lot with RO-Crate lately. It's a way of describing & packaging research data. Here's a post about how I've updated the Newspaper & Gazette Harvester to automatically document every harvest it creates using RO-Crate: updates.timsherratt.org/2023/0

#trove #researchinfrastructure #rocrate #glam #digitalhumanities

Last updated 1 year ago

Tim Sherratt · @wragge
1087 followers · 957 posts · Server hcommons.social

Not really much in this webinar to help people undertake new forms of digital research (which I thought was the point of the ARDC investment). But anyway, in a few more months the Data Guide will cover all of that and more (still much to do... ๐Ÿ˜ฌ): wragge.github.io/trove-data-gu

#trove

Last updated 1 year ago

Tim Sherratt · @wragge
1088 followers · 955 posts · Server hcommons.social

Now talking about citations... Guess what? It would be a hell of a lot easier to capture and manage citations if hadn't broken the Zotero translator with the 2020 update... ๐Ÿ˜ก (though Zotero still works with individual newspaper articles)

#trove

Last updated 1 year ago

Tim Sherratt · @wragge
1088 followers · 941 posts · Server hcommons.social

Tuning in to the "How to research on " webinar. Includes an update on some recent ARDC-funded improvements to the API and web interface. Chat is disabled... so maybe I'll drop some comments here.

#trove

Last updated 1 year ago

Tim Sherratt · @wragge
1088 followers · 939 posts · Server hcommons.social

There's a new 'How to research' page on , but I have to say it's a bit disappointing: trove.nla.gov.au/blog/2023/08/ Hopefully, I can fill in some gaps with the Trove Data Guide.

#trove

Last updated 1 year ago

Tim Sherratt · @wragge
1088 followers · 939 posts · Server hcommons.social

New version of the Newspaper Harvester section of the (v2.0.0). Now using v3 of the Trove API. glam-workbench.net/trove-harve

#trove #glamworkbench #glam #digitalhumanities

Last updated 1 year ago

Tim Sherratt · @wragge
1088 followers · 935 posts · Server hcommons.social

There's a new version of the Newspaper & Gazette Harvester Python package โ€“ now using v3 of the Trove API, and automatically generating an RO-Crate file to capture the details of each harvest. Use it as a library or a command line tool to harvest metadata, text, images & PDFs from thousands (even millions) of digitised newspaper articles.

Release details: github.com/wragge/trove-newspa

Full documentation: wragge.github.io/trove-newspap

#trove #glam #digitalhumanities #histodons

Last updated 1 year ago

Tim Sherratt · @wragge
1088 followers · 930 posts · Server hcommons.social

Aaand I've updated the 's list of breaking changes in the API v3 with today's discoveries: glam-workbench.net/trove-api-v

#glamworkbench #trove #glam #digitalhumanities

Last updated 1 year ago

Tim Sherratt · @wragge
1088 followers · 929 posts · Server hcommons.social
Tim Sherratt · @wragge
1088 followers · 928 posts · Server hcommons.social

Accidently typed 'arse_query` instead of `parse_query` and that's about how I'm feeling about the API update at the moment...

#trove

Last updated 1 year ago

Tim Sherratt · @wragge
1088 followers · 925 posts · Server hcommons.social

๐Ÿ˜ก Another undocumented, breaking change in v3 of the API. the `illtype` facet has been renamed `illustrationType`. Excuse while I now go waste an hour or so updating the Trove API Console, the trove-query-parser etc...

#trove #digitalhumanities

Last updated 1 year ago