Volkan Özçelik · @volkan
35 followers · 1223 posts · Server z2h.dev

Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON

github.com/johnkerl/miller

#miller #tools #DataAnalysis #terminal #apps #filter #search

Last updated 2 years ago

Volkan Özçelik · @volkan
35 followers · 1223 posts · Server z2h.dev

Orange is an open source machine learning and data visualization toolbox

orangedatamining.com/

#datamining #DataAnalysis #toolbox #orange #tools #datascience

Last updated 2 years ago

Volkan Özçelik · @volkan
22 followers · 936 posts · Server z2h.dev

OctoSQL is a query tool that allows you to join, analyse and transform data from multiple databases and file formats using SQL.

github.com/cube2222/octosql

#sql #query #tools #octosql #data #DataAnalysis #join #merge

Last updated 2 years ago

Volkan Özçelik · @volkan
16 followers · 821 posts · Server z2h.dev

In case you manage a Mastodon instance, a query like

```
select DISTINCT tag_id, name from statuses inner join statuses_tags on statuses.id = statuses_tags.status_id inner join tags on statuses_tags.tag_id = tags.id where account_id=109326239149329075 ORDER BY name;
```

will give you a nicely sorted list of “tags use by a certain user”

That can be useful in a multitude of things.

#data #DataAnalysis #mastodon #tips #queries

Last updated 2 years ago

Volkan Özçelik · @volkan
16 followers · 821 posts · Server z2h.dev

Kangas 🦘 is a tool for exploring, analyzing, and visualizing large-scale multimedia data.

github.com/comet-ml/kangas

#data #datascience #DataAnalysis #tools

Last updated 2 years ago

Volkan Özçelik · @volkan
16 followers · 821 posts · Server z2h.dev

Tests for Continuous Validation of ML Models & Data. Deepchecks is a Python package for comprehensively validating your machine learning models and data with minimal effort.

github.com/deepchecks/deepchec

#python #machinelearning #Validation #libraries #data #DataAnalysis

Last updated 2 years ago