Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
https://github.com/johnkerl/miller
#miller #tools #DataAnalysis #terminal #apps #filter #search
#miller #tools #DataAnalysis #terminal #apps #filter #search
Orange is an open source machine learning and data visualization toolbox
#DataMining #DataAnalysis #toolbox #orange #tools #dataScience
#datamining #DataAnalysis #toolbox #orange #tools #datascience
OctoSQL is a query tool that allows you to join, analyse and transform data from multiple databases and file formats using SQL.
https://github.com/cube2222/octosql
#sql #query #tools #OctoSQL #data #DataAnalysis #join #merge
#sql #query #tools #octosql #data #DataAnalysis #join #merge
In case you manage a Mastodon instance, a query like
```
select DISTINCT tag_id, name from statuses inner join statuses_tags on statuses.id = statuses_tags.status_id inner join tags on statuses_tags.tag_id = tags.id where account_id=109326239149329075 ORDER BY name;
```
will give you a nicely sorted list of “tags use by a certain user”
That can be useful in a multitude of things.
#data #DataAnalysis #mastodon #tips #queries
Kangas 🦘 is a tool for exploring, analyzing, and visualizing large-scale multimedia data.
#data #datascience #DataAnalysis #tools
Tests for Continuous Validation of ML Models & Data. Deepchecks is a Python package for comprehensively validating your machine learning models and data with minimal effort.
https://github.com/deepchecks/deepchecks
#Python #MachineLearning #Validation #libraries #data #DataAnalysis
#python #machinelearning #Validation #libraries #data #DataAnalysis