DigitalPebble Ltd · @digitalpebble
23 followers · 60 posts · Server fosstodon.org

@tallison @OpenSearchProject
Or just use ? :mastoinnocent:

#StormCrawler

Last updated 2 years ago

DigitalPebble Ltd · @digitalpebble
22 followers · 51 posts · Server fosstodon.org

Have added test coverage for

coveralls.io/github/DigitalPeb

As expected pretty low on average, partly explained by the fact that writing tests for Bolts is not trivial but at least we can now see where new tests should be added.

BTW are great

#StormCrawler #tests #opensource #contributions

Last updated 2 years ago

DigitalPebble Ltd · @digitalpebble
22 followers · 48 posts · Server fosstodon.org

Hoping to benchmark + with segment replication

#StormCrawler #opensearch

Last updated 2 years ago

DigitalPebble Ltd · @digitalpebble
22 followers · 48 posts · Server fosstodon.org
DigitalPebble Ltd · @digitalpebble
22 followers · 48 posts · Server fosstodon.org

@davidshq
Definitely. To give an example, one of the top EU online retailers use but won't publicise (or sponsor) it. Their legal department advised them not to because it would expose the way they use it and that is seen as a risk.

#StormCrawler

Last updated 2 years ago

DigitalPebble Ltd · @digitalpebble
18 followers · 27 posts · Server fosstodon.org

We're super excited about being used by the project.

openwebsearch.eu/owler/

#StormCrawler #openwebsearch

Last updated 3 years ago

DigitalPebble Ltd · @digitalpebble
17 followers · 25 posts · Server fosstodon.org

Should we support tracing in ? Anyone using tools like Datadog when crawling to track slow URLs and bottlenecks?

#StormCrawler

Last updated 3 years ago

Tobias Zeumer · @vform
299 followers · 5307 posts · Server openbiblio.social

Missing Link: Offener Web-Index soll Europa bei der Suche unabhängig machen

Mit der von der EU geförderten Entwicklung eines Open Web Index wollen Forscher die Dominanz von Google & Co. brechen und das menschliche Wissen verbreitern.

heise.de/-7466867

OpenWebSearch.EU

Ferner

#searchengine #eu #owi #EuropeanOpenWebIndex #OWSAI #OpenWebSearchAndAnalysisInfrastructure #openwebsearch #suma #osf #serci #StormCrawler #gigablast #findx #quaero #Theseus #commoncrawls

Last updated 3 years ago

DigitalPebble Ltd · @digitalpebble
17 followers · 24 posts · Server fosstodon.org

We are pleased to announce that DigitalPebble Ltd is a partner of the OpenSearch Project.

In case you have missed it, has a module for since its latest release and hopefully there will be more good things to come!

opensearch.org/partners

#StormCrawler #opensearch

Last updated 3 years ago

DigitalPebble Ltd · @digitalpebble
16 followers · 19 posts · Server fosstodon.org

Call to all users: we will release a new version shortly so that people can benefit from the latest additions () and improvements (). Any chance you could test some crawls with the latest code in the main branch and report any issues? Thanks

#StormCrawler #opensearch #warc

Last updated 3 years ago

DigitalPebble Ltd · @digitalpebble
17 followers · 24 posts · Server fosstodon.org

Call to all users: we will release a new version shortly so that people can benefit from the latest additions () and improvements (). Any chance you could test some crawls with the latest code in the main branch and report any issues? Thanks

#StormCrawler #opensearch #warc

Last updated 3 years ago

DigitalPebble Ltd · @digitalpebble
16 followers · 18 posts · Server fosstodon.org
DigitalPebble Ltd · @digitalpebble
17 followers · 24 posts · Server fosstodon.org
DigitalPebble Ltd · @digitalpebble
16 followers · 16 posts · Server fosstodon.org

Just opened a PR to port the content of the module of to

includes simple

Feedback welcome as usual

#elasticsearch #StormCrawler #opensearch #dashboards

Last updated 3 years ago

DigitalPebble Ltd · @digitalpebble
16 followers · 17 posts · Server fosstodon.org

A very nice contribution to improving the generation of files

github.com/DigitalPebble/storm

#StormCrawler #warc #webarchiving

Last updated 3 years ago

DigitalPebble Ltd · @digitalpebble
13 followers · 12 posts · Server fosstodon.org
DigitalPebble Ltd · @digitalpebble
14 followers · 12 posts · Server fosstodon.org

There is a paradox with the sponsoring of : the only organisations who have financially supported our work are very small, typically less than 5 employees. Meanwhile, larger ones (some of which have multi-million $£€ budgets and use SC on a large scale) do not donate at all, nor contribute any code. Most of them are also very reluctant to acknowledging publicly their use of it. Is it down to the bureaucratic hassle of convincing ppl up the decision ladder? What do you think?

#StormCrawler

Last updated 3 years ago

DigitalPebble Ltd · @digitalpebble
14 followers · 12 posts · Server fosstodon.org

Now merged. This will be in the next release of

#StormCrawler

Last updated 3 years ago

DigitalPebble Ltd · @digitalpebble
14 followers · 12 posts · Server fosstodon.org

Fancy trying the new version of the archetype which uses as a backend?

github.com/DigitalPebble/storm

#StormCrawler #URLFrontier

Last updated 3 years ago