Fonti · @fontikar
13 followers · 10 posts · Server fosstodon.org

@lukasjung @josi thanks Lukas! I’m going through some chapters in Advanced R and and some vignettes in ! Will definitely check out your recommendation with docs :)

#dplyr #rlang

Last updated 1 year ago

François Michonneau · @fmic_
484 followers · 74 posts · Server hachyderm.io

If you are wondering what the future of looks like, you might be interested in this article doi.org/10.14778/3603581.36036

The authors argue that despite increasing specialization, databases share the same components. By relying on standards, you can develop a composable system with:
- your preferred language interface (, , , ...). The query is converted into an intermediate representation (like )
- a query optimizer
- an execution engine
- an execution runtime

#databases #sql #ibis #dplyr #substrait

Last updated 1 year ago

Olivier Grisel · @ogrisel
1957 followers · 262 posts · Server sigmoid.social

Yesterday I learned at the tutorial that Ibis now offers an implementation of the across function first introduced in to conveniently and concisely apply transformations on a set of columns defined by selectors (e.g. based on column data types or name patterns).

This is especially convenient to implement scalable, in-DB feature engineering for machine learning models.

More examples in these blog post:

ibis-project.org/blog/selector

#euroscipy2023 #ibisdata #dplyr

Last updated 1 year ago

Imagine you have a bunch of data points and you want to know how many belong to different categories. This is where grouped counting comes in. We've got three fantastic methods for you to explore, each with its own flair: **`aggregate()`**, **`dplyr`**, and **`data.table`**.

Happy counting, fellow data explorer! 🎉🔍

Post: spsanderson.com/steveondata/po

#datatable #RStats #r #baser #aggregate #dplyr #exploredata #rprogramming #dataanalysis

Last updated 1 year ago

Kai 🐈‍⬛ · @kaio
65 followers · 278 posts · Server fosstodon.org

was having trouble with dplyr's `case_when` when I realized that all cases are evaluated eagerly. TIL.

#dplyr #rstats

Last updated 1 year ago

Marine Mas, PhD · @phdmas
38 followers · 112 posts · Server sciences.social

how do I know that my code is tiring me out? I stop resisting and use

#r #dplyr

Last updated 1 year ago

📢 Simplify Data Analysis in R: Easy Summarization Methods! 🚀

Are you an R enthusiast looking to streamline your data analysis? Look no further! Let's explore two fantastic ways to summarize data and gain valuable insights effortlessly! 📊

See attached!

Blog Post: spsanderson.com/steveondata/po

#innovation #Technology #OpenSourceCommunity #opensourcesoftware #summary #tibble #dplyr #r #tapply #HappyCoding #simplifyyourwork #datasummarization #DataScience #RStats #dataanalysis

Last updated 1 year ago

I encourage you to roll up your sleeves and give it a try yourself. 💪🔍

Read the full blog post and start your exploration. Let's dive in and level up your data analysis game! 🚀📊

spsanderson.com/steveondata/po

#DataManipulation #baser #dplyr #duplicatesdetection #dataanalysis #rprogramming

Last updated 1 year ago

Jesse Onland · @jdonland
37 followers · 75 posts · Server vis.social

Computing on a data frame in :

One value per row? mutate
One per group? summarize
Anything else: reframe

#tidyverse #dplyr

Last updated 1 year ago

Sharon Machlis · @smach
2138 followers · 28 posts · Server masto.machlis.com

“Intro to Data Science with R: Unlocking the power of R for public health data science”
New PDF to accompany free video courses by University of Geneva's GRAPH Network. Project supported by WHO.

drive.google.com/file/d/1Rw-hd

Video courses: thegraphcourses.org/courses-po
R Foundations, Data Wrangling with dplyr, with , Introduction to EpiGraphHub, and Intro to Geospatial Visualization with R

@rstats

#dataviz #ggplot2 #rstats #rspatial #dplyr

Last updated 1 year ago

Joshua de la Bruere · @delabj
375 followers · 423 posts · Server fosstodon.org

Hey friends I'm looking at wrapping a typical join in a function in a fairly opinionated way, however, I want to be able to add extra conditions to the by clause. (Sometimes I may have multiple ids while this function handles some work logic ) is there a good way to combine the results of join_by?

Is append? Enquo the arguments? Any help is great

#rstats #tidyverse #dplyr

Last updated 1 year ago

datamaps :rickwhoah: · @datamaps
409 followers · 798 posts · Server social.linux.pizza

@jospueyo @smach

well, people nowadays think that you need tons of dependencies to work (and is also so verbose I get bored only just to code a few ops)
it is actually ridiculous the amount of libraries it installs...

But I wait to comment after actually having tried it, I won't mind learn a new approach if I can reap some benefits from it

#dplyr

Last updated 1 year ago

François Michonneau · @fmic_
432 followers · 47 posts · Server hachyderm.io

A short blog post where I show how to use to connect to a remote file hosted over HTTPS and work with it using :

francoismichonneau.net/2023/06

#duckdb #parquet #dplyr

Last updated 1 year ago

François Michonneau · @fmic_
484 followers · 74 posts · Server hachyderm.io

A short blog post where I show how to use to connect to a remote file hosted over HTTPS and work with it using :

francoismichonneau.net/2023/06

#duckdb #parquet #dplyr

Last updated 1 year ago

Kit · @_kit
8 followers · 131 posts · Server fosstodon.org

I know we’re up to our eyeballs in goofs, but this one struck me as particularly puzzling.

Has there ever been a dplyr::compare (I have found no evidence)? Why aren’t there more arguments? Why is the third argument so specific? I’d love to know the provenance of the logic here. Cc: @hadleywickham

#chatgpt #rstats #dplyr

Last updated 2 years ago

Thomas Sandmann · @thomas_sandmann
296 followers · 735 posts · Server genomic.social

@MrHedmad If you are comfortable writing SQL then might be useful: duckdb.org/docs/data/csv/overv and if you are an person then you can query duckdb with , too.

#duckdb #RStats #dplyr

Last updated 2 years ago

Are you tired of manually copying and pasting data from multiple sheets in an Excel file? Say no more! In this post, we will show you how to use R programming to read in an Excel file with multiple sheets using a simple function.

Post: spsanderson.com/steveondata/po

#dplyr #tibble #softwaredevelopment #OpenSource #RStats #r #healthyr #readxl #Excel #dataanlysis #Data

Last updated 2 years ago

Mike Blazanin · @mikeblazanin
261 followers · 697 posts · Server ecoevo.social

I just released gcplyr v1.5.0 on github.

gcplyr is an package that makes it easy to wrangle and do model-free analyses of growth curve data

mikeblazanin.github.io/gcplyr/

v1.5.0 includes:
- fixes for critical bugs in derivative calculation
- improved handling of NA values, including several new functions
- new function for linear interpolation
- new args when calculating area under the curve

#rstats #microbial #microbiology #MicrobialEcology #MicrobialEvolution #ggplot2 #tidyverse #dplyr #data

Last updated 2 years ago

Looking to analyze FAANG stocks? Check out this exciting R Shiny app! Well it won't really give a full or even close to a full analysis but it shows how you can make a very simple shiny app to analyze some data.

Post: spsanderson.com/steveondata/po

#Data #learning #densityplot #stockreturns #logreturns #stocks #faang #shiny #dt #dplyr #tidydensity #tidyquant #Software #OpenSource #RStats #r

Last updated 2 years ago