I had recently posted on benchmarking the reading in of a .csv file but received an email over the weekend pointing out the omission of something like csv.gz file(s).
Functions tested in the benchmark:
✅ read.table
✅ read.csv
✅ fread
✅ vroom with altrep=false
✅ vroom with altrep=true
✅ read_csv
Post: https://www.spsanderson.com/steveondata/posts/rtip-2023-03-27/
#data #help #softwaredevelopment #compression #gz #r #rstats #vroom #datatable #readr #tidyverse #baser #opensource #innovation #technology #software #benchmarking
#benchmarking #Software #Technology #innovation #OpenSource #baser #tidyverse #readr #datatable #vroom #RStats #r #gz #compression #softwaredevelopment #Help #Data
An excellent cross-post by @nic_crane for https://dataalltheway.com on
Type inference in #readr and #arrow
https://dataalltheway.com/posts/012-type-inference-in-readr-and-arrow/
#rstats #statistics #datascience #bigdata #machinelearning #csv
#readr #arrow #rstats #statistics #datascience #bigdata #machinelearning #csv
@rstats@gup.pe
@jorge posted a quite interesting #webinar #shortcourse on how to handle data efficiently with #rstats
• data management plans
• version control
• R for reproducible data manipulation
• working on clusters
• data publication
#shateEGU20 #FAIRprinciples #tidyverse #dplyr #broom #tidyr #purrr #readr #ggplot2 #markdown #git #spatialdata
#webinar #shortcourse #rstats #shateEGU20 #FAIRprinciples #tidyverse #dplyr #broom #tidyr #purrr #readr #ggplot2 #markdown #git #spatialdata