Ludovic · @usul
549 followers · 6710 posts · Server piaille.fr
Mathias Fußenegger · @mathias
148 followers · 366 posts · Server social.fussenegger.pro

Thanks to and the folks contributing to it, it only took a couple days to turn into a vector database.

It still boggles my mind seeing how much work you can save due to good libraries

#lucene #cratedb

Last updated 1 year ago

Dave Mackey · @davidshq
710 followers · 1006 posts · Server hachyderm.io

2/ targeted towards a more general audience. Thakare/Laddha/Pawar's Hybrid Intelligent Systems for Information Retrieval looks like a possibility...

Also open to books that approach from a specific implementation perspective, e.g. , , ...but most appear to be older / specific subtopic focused.

#search #ElasticSearch #solr #lucene

Last updated 1 year ago

LisPi · @lispi314
278 followers · 4089 posts · Server mastodon.top

@futurebird @Jirikiha You might be able to build something with (en.wikipedia.org/wiki/Apache_L) or, for a much more lightweight option, with (en.wikipedia.org/wiki/Xapian).

You'd still have to build something yourself from those though. For Xapian, looking at the source for and would probably be usable as a decent example (djcbsoftware.nl/code/mu/).

You'd also need to figure out some way to feed data exports from those into it.

All of my suggestions are & gratis.

#freesoftware #lucene #xapian #mu4e #mu

Last updated 1 year ago

Nemo_bis 🌈 · @nemobis
868 followers · 3365 posts · Server mamot.fr

@mage @andybaio Indeed. For similar reasons, it took over 10 years since its creation for Foundation to prioritise any serious investment on search: mediawiki.org/wiki/Extension:C

For the longest time, at WMF was maintained by a lone volunteer, river.

Nowadays there's an entire team which powers some of the best aware search in the web + some translation memory .
mediawiki.org/wiki/Wikimedia_S

Trey Jones' notes are a treasure trove.
mediawiki.org/wiki/User:TJones

#wikimedia #mediawiki #lucene #i18n #tm #nlp

Last updated 2 years ago

Nemo_bis 🌈 · @nemobis
843 followers · 3241 posts · Server mamot.fr

@wchr The quoted sentence from eff.org/deeplinks/2023/01/eff- is puzzling but it's commenting a hypothetical scenario where the changes are major: «substantive claims related to how their systems recommend, promote, rank, arrange, or otherwise display content posted by their users».
eff.org/files/2023/01/19/21-13

It's a scenario where misconfiguring your site's is a legal liability.

On the broader point, we still don't know that causes .
techdirt.com/2021/11/03/whole-

#lucene #youtube #radicalization #section230

Last updated 2 years ago

Johannes Schüth · @jotschi
13 followers · 26 posts · Server fosstodon.org

I added the face recognition project to my list of PoC's on GitHub. It now uses to lookup faces and to store them in a database. I wonder how long it would take to scan the FFHQ dataset. My jdlib fork still lacks mini-batch support for GPU image processing. Time for more JNI coding.😭

github.com/metaloom/poc-video4

#lucene #jooq

Last updated 2 years ago

Johannes Schüth · @jotschi
7 followers · 21 posts · Server fosstodon.org

I'm currently adding a module / API to video4j. The current implementation is using opencv. I'll try a CNN based face detector using dlib next. Should be much faster since it is GPU powered. I want to automatically extract embeddings and use kNN search with HnswGraph (Hierarchical Navigable Small World graph) to test face recognition / matching.

#facedetection #lucene

Last updated 2 years ago

masukomi · @masukomi
197 followers · 2579 posts · Server connectified.com

not knowing the backstory between & kept nagging at me, so i did a little google spelunking. Here's the short version:

's creator Damien Katz formed a company called CouchIO after CouchDB became an project.

CouchIO offered hosting + nice to haves like , geospacial indexing, etc.

CouchIO renamed themselves as CouchOne and released a mobile dev platform based on CouchDB and optomized for mobile devices.

🧵 1/?

#couchdb #couchbase #apache #lucene

Last updated 2 years ago

CrateDB · @cratedb
44 followers · 14 posts · Server fosstodon.org

Do you know how to write operations in ? 👀

In our new blog post, we will give you a throughout understanding of how writes new records 🤓 Learn the basic concepts of and the concept of 👇
hubs.ly/Q01w74N50

#cratedb #lucene #translog

Last updated 2 years ago

o19s · @o19s
14 followers · 3 posts · Server fosstodon.org

An to OpenSource Connections- we're a group of specialists in search engines such as , , & based across the US, UK and EU. We're known for the Manning book 'Relevant Search', the Haystack conference series and the 3000+ person Relevance Slack. Our mission is to Empower Search Teams to build more accurate & relevant search engines using data-driven, repeatable, hypothesis-based processes & techniques. We help make search better!

#introduction #opensource #lucene #solr #elasticsearch #opensearch

Last updated 2 years ago

o19s · @o19s
39 followers · 8 posts · Server fosstodon.org

An to OpenSource Connections- we're a group of specialists in search engines such as , , & based across the US, UK and EU. We're known for the Manning book 'Relevant Search', the Haystack conference series and the 3000+ person Relevance Slack. Our mission is to Empower Search Teams to build more accurate & relevant search engines using data-driven, repeatable, hypothesis-based processes & techniques. We help make search better!

#introduction #opensource #lucene #solr #elasticsearch #opensearch

Last updated 2 years ago

o19s · @o19s
14 followers · 3 posts · Server fosstodon.org

A quick introduction to OpenSource Connections- we're a group of specialists in search engines such as , , & based across the US, UK and EU. We're known for the Manning book 'Relevant Search', the Haystack conference series and the 3000+ person Relevance Slack. Our mission is to Empower Search Teams to build more accurate & relevant search engines using data-driven, repeatable, hypothesis-based processes & techniques. We help make search better!

#opensource #lucene #solr #elasticsearch #opensearch

Last updated 2 years ago

Charlie Hull · @flaxsearch
25 followers · 3 posts · Server hachyderm.io

So here's my - I work for OpenSource Connections (OSC) @o19s, we offer consulting on search engines - , , and now in the domain of Search Relevance - basically we help companies using these engines deliver the right results to their users. I'm currently heading up Marketing for OSC but I also help with sales, run our Haystack conference series, write, blog and present talks on search and run some customer projects.

#introduction #opensource #lucene #solr #ElasticSearch

Last updated 2 years ago

Dave Mackey · @davidshq
203 followers · 218 posts · Server hachyderm.io

@Josh412 I think so. 🙂 There are some great open source search engines out there already like (on which both and are built). I'm particularly interested in improving web search, while the number of contenders has increased, imho, they are all essentially competing on the same ML basis and thus can't be truly disruptive. I'd like to see a ML engine with human augmentation. This has been attempted several times before (Blekko, Wikia Search, Zakta, etc.)...

#lucene #ElasticSearch #solr

Last updated 2 years ago

Hrefna (DHC) · @hrefna
300 followers · 850 posts · Server hachyderm.io

On one end of the complexity scale we could have a single box setup:

1. or as a sort of reverse proxy.
2. Vertx as an HTTP server and Guava caches for caching.
3. A simple LinkedBlockingQueue as our queuing mechanism. Maybe with a to-disk write-ahead log.
4. The EventBus as a controller.
5. Vertx verticles as our processors.
6. as a database and post store.
7. On-disk media storage.
8. for search.

Here we've checked all of the boxes. It won't scale, however. 3/

#ktor #vertx #sqlite #lucene

Last updated 2 years ago

Cheatography · @cheatography
3 followers · 24 posts · Server botsin.space

Just released: Confluence search syntax Cheat Sheet by luisfe

Download it free at cheatography.com/luisfe/cheat-

Here's their description of it: Confluence's syntax to refine search results

@cheatsheets

#cheatsheet #cheatsheets #search #atlassian #lucene #confluence

Last updated 2 years ago

Dalatangi · @dalatangi
97 followers · 122 posts · Server fosstodon.org

If you're a java software engineer with a background in (especially or ) this job opening might be just for you jobs.elastic.co/jobs/elasticse

#search #elasticsearch #lucene

Last updated 2 years ago

Amy Fountain · @amyfou
526 followers · 633 posts · Server lingo.lol

anyone here use Elasticsearch or anything else based on Lucene, implemented rootless? There is a write.lock permissions error that we can't seem to shake

#Tech #nlproc #linguistics #elasticsearch #lucene #writelock #rootless

Last updated 2 years ago

Raf · @Raf
298 followers · 5913 posts · Server mastodon.social

RT @nknize@twitter.com

OK friends, @elastic@twitter.com geo is getting super exciting! After nearly 5 years, and many new spatial data structures & field types, you'll soon be able to index spatial data in its native CRS w/o reprojecting to WGS84 lat/lon!

🐦🔗: twitter.com/nknize/status/1170

#foss4g #lucene #gis #spatialIndexing #geoGeek

Last updated 5 years ago