Taras Novak πŸ‡ΊπŸ‡¦ · @dataSamurai
143 followers · 155 posts · Server vis.social

Devs and data scientists really like our EDA public scripts, notebooks πŸ“š and data snapshots repository I created last October. That sample data/demo repository covers many different tools, libraries and notebooks to parse :

⭐️ ‑ github.com/RandomFractals/chic

πŸ“œ ‑ twitter.com/search?q=(%23Chica

πŸ› οΈ ...

#datatools #largedata #ChicagoCrimes

Last updated 3 years ago

Taras Novak πŸ‡ΊπŸ‡¦ · @dataSamurai
128 followers · 120 posts · Server vis.social

Quick demo of our new vscode extension loading and querying 7,687,725 recorded in 2001 through the end of November 2022 from a large 1.68 GB CSV data file in seconds ... See demo gif at:

πŸ“° github.com/RandomFractals/chic

πŸ’ŽπŸ’ŽπŸ’Ž

#datatools #vscode #sqltools #duckdb #ChicagoCrimes #duckdbsqltools

Last updated 3 years ago

Taras Novak πŸ‡ΊπŸ‡¦ · @dataSamurai
123 followers · 115 posts · Server vis.social

Our new VSCode extension is almost ready for prime time.

You'll be able to load remote CSV and data files via httpfs extension and create in-memory instances too.

See demo gif of loading parquet data from a GitHub repository into memory, creating a CrimeReports table, and querying it on twitter:

twitter.com/TarasNovak/status/

/ πŸ”¬πŸ’ŽπŸ’ŽπŸ’Ž...

#datatools #sqltools #vscode #ChicagoCrimes #duckdb #parquet #duckdbsqltools

Last updated 3 years ago

Taras Novak πŸ‡ΊπŸ‡¦ · @dataSamurai
94 followers · 99 posts · Server vis.social
Taras Novak πŸ‡ΊπŸ‡¦ · @dataSamurai
88 followers · 98 posts · Server vis.social

Updated with gzipped CSV (~3.25MB). The app now loads 215,551 crime reports with in a browser in about 8 seconds total for the runtime, data transformation with 🐼 & charting with πŸ“ŠπŸ“ˆ

randomfractals.github.io/chica

#altair #pandas #python #pyodide #dataapp #Pyscript #ChicagoCrimes

Last updated 3 years ago

Taras Novak πŸ‡ΊπŸ‡¦ · @dataSamurai
88 followers · 97 posts · Server vis.social

Running some quick data summary queries with on a 2001-2022 parquet data file that is 533MB, created form a larger 1.66GB CSV data, without any compression. Very responsive and fast query execution thanks to and Malloy extension.

View those queries in action in this GIF: twitter.com/TarasNovak/status/

πŸ› οΈ ...

#datatools #vscode #duckdb #ChicagoCrimes #malloy

Last updated 3 years ago

Taras Novak πŸ‡ΊπŸ‡¦ · @dataSamurai
79 followers · 93 posts · Server vis.social

Our 🈸 for now has over 350,000 installs. You can load large CSV files, sort & graph results with aggregate functions, and much more.

See an example of loading 48MB of CSV data: twitter.com/TarasNovak/status/

Note: change data.preview.theme to light. See: github.com/RandomFractals/vsco

πŸ“₯ marketplace.visualstudio.com/i

πŸ“ŠπŸ“ˆ πŸ› οΈ for ...

#datascientists #datatools #dataviz #ChicagoCrimes #vscode #datapreview

Last updated 3 years ago

Taras Novak πŸ‡ΊπŸ‡¦ · @dataSamurai
79 followers · 91 posts · Server vis.social

so cool! :)
---
RT @TarasNovak
Created a web page with loading 2022 CSV data with and visualizing that data with charting lib:
github.com/RandomFractals/chic
...
twitter.com/TarasNovak/status/

#Pyscript #ChicagoCrimes #pandas #altair #dataapps

Last updated 3 years ago

Taras Novak πŸ‡ΊπŸ‡¦ · @dataSamurai
77 followers · 89 posts · Server vis.social
Taras Novak πŸ‡ΊπŸ‡¦ · @dataSamurai
76 followers · 88 posts · Server vis.social

Created a web page with loading 2022 CSV data with and visualizing that data with πŸ“‰πŸ“Š charting lib:

🧐 github.com/RandomFractals/chic

...

#dataapps #altair #pandas #ChicagoCrimes #Pyscript

Last updated 3 years ago

Taras Novak πŸ‡ΊπŸ‡¦ · @dataSamurai
74 followers · 87 posts · Server vis.social

So, I ported my old data wrestling with plots to new repo. See this with many crime data summaries and plots from 2001 to present:

🧐 github.com/RandomFractals/chic

πŸ“š ...

#datanotebooks #jupyternotebook #matplotlib #pandas #ChicagoCrimes

Last updated 3 years ago

Taras Novak πŸ‡ΊπŸ‡¦ · @dataSamurai
64 followers · 86 posts · Server vis.social

Hey πŸ€“, good news:

v0.6.0 brings reading data on par with & and loads 1.66 GB of data in 1.9s with 12 cores/24 threads when experimental parallel CSV reader & unordered insertion are enabled.

🧐 github.com/RandomFractals/chic

πŸ”¬ ...

#datatools #ChicagoCrimes #polars #pyarrow #csv #duckdb #datanerds

Last updated 3 years ago

Taras Novak πŸ‡ΊπŸ‡¦ · @dataSamurai
63 followers · 84 posts · Server vis.social

Displaying parquet data with charts, imported table data source, measures, reusable queries, limits, nested grouping and bar chart renderer:

πŸ”¬ github.com/RandomFractals/chic

πŸ“Š ...

#datatools #datavis #vscode #malloy #ChicagoCrimes

Last updated 3 years ago

Taras Novak πŸ‡ΊπŸ‡¦ · @dataSamurai
52 followers · 83 posts · Server vis.social

Updated with tools example & more info in docs:

🧐 github.com/RandomFractals/chic

Clone that data repo & install Malloy extension to try it out:

πŸ“₯ marketplace.visualstudio.com/i

πŸ”¬ ...

#datatools #vscode #MalloyData #ChicagoCrimes

Last updated 3 years ago

Taras Novak πŸ‡ΊπŸ‡¦ · @dataSamurai
47 followers · 82 posts · Server vis.social

I've decided to try today.

Here is a quick example of loading 2022 with and Malloy queries for some rough counts and data summaries:

github.com/RandomFractals/chic

πŸ› οΈ ...

#datatools #duckdb #parquetdata #ChicagoCrimes #MalloyData

Last updated 3 years ago

Taras Novak πŸ‡ΊπŸ‡¦ · @dataSamurai
47 followers · 82 posts · Server vis.social

so, you want to parse data with like a ?

πŸ”­ this data loading repo: github.com/RandomFractals/chic

Pertinent thread on the ✴️: twitter.com/TarasNovak/status/

πŸ› οΈ ...

#datatools #deathstar #ChicagoCrimes #jedi #polars #parquet

Last updated 3 years ago

Taras Novak πŸ‡ΊπŸ‡¦ · @dataSamurai
47 followers · 82 posts · Server vis.social

I see some Observable fans here and others wanting to learn how to construct . You can explore our old πŸ“š: observablehq.com/@randomfracta

#ChicagoCrimes #JSNotebooks

Last updated 3 years ago