Continuing my #DataScienceJourney, and this time I'm happy to share that I've completed my first PR feature contribution to an #opensource #Rstats package🥳
I added a feature to {janitor}, where `row_to_names()` can accept multiple rows as input.
Many thanks to Bill Denney an @samfirke for facilitating and guiding along the way.
You can see the PR process here: https://github.com/sfirke/janitor/pull/542
#datasciencejourney #opensource #rstats
With my experience in software dev I always had problems to see the benefit of Jupyter notebooks.
But digging into EDA I can totally see the advantages of the tool. Having the opportunity to quickly go through datasets and document findings on the run is just great.
#DataScience #datasciencejourney
So I guess this is well past the time for an #introduction:
I'm Matan Hakim, a Policy Data Scientist. I'm Head of Research at the Israeli Institute for Cultural Policy, an NGO promoting cultural development.
I'm also a researcher at Arlozorov Forum, the research institute of the largest trade union in Israel.
I use #Rstats and mostly #Tidyverse, learning more every day. I post notes from my #DataScienceJourney as I go along.
You're welcome to follow if these are your fields🙏
#introduction #rstats #tidyverse #datasciencejourney
Continuing my #DataScienceJourney, today on bug handling.
I encountered a behavior I didn't expect with {ggpattern}. at first I created a *somewhat* minimal reprex, and filed it to @R4DSCommunity and {ggpattern}'s GitHub.
after realizing my problem was with #ggplot2, I filed an issue to its Github.
And only then, after minimizing the reprex as much as I Could, I realized that this isn't a bug but wrong code😭
The cliche about writing a reprex and finding the bug is true after all!
#datasciencejourney #ggplot2 #rstats
TIL that #dvc stage add will happily create and run stages with backslashes in the path
BUT the the DVC #VSCode plugin and dvc plots will not be able to locate plots with backslashes in the path
Let's see if I can remember this gotcha until next time 🙈
#DVC #vscode #datasciencejourney #MLops
TIL that #dvc stage add will happily create and run stages with backslashes in the path
BUT the the DVC #VSCode plugin and dvc plots will not be able to locate plots with backslashes in the path
Let's see if I can remember this gotcha until next time 🙈
#DVC #vscode #datasciencejourney #MLops
Today's #DataScienceJourney post isn't about #Rstats, #DataScience or even #programming.
It's about #learning and about #community.
I had trouble for the past few weeks with presenting text labels above my bar charts using #ggplot.
After a lot of time spent in #StackOverflow, I decided to turn to the @R4DSCommunity #Slack channel.
This turned out as a great idea, as I got really good answers as well as guidance. I'll use this much more often in the future.
Thanks to Jack Penzer for the help
#datasciencejourney #rstats #datascience #programming #learning #community #ggplot #stackoverflow #slack
Today's #DataScienceJourney post is about tables in #Rstats, and specifically with the {gtsummary}📦 package!
In an analysis I'm conducting I needed to create a quick descriptive statistics frequency table for all of the variables, and a simple linear regression model also including all variables as predictors.
As I found out, it is as easy as it gets. >>>
After I caught up with the Data Science Hangout featuring Matt Dancho, hosted by @rachael from @Posit, I decided to take Matt's advice.
So I'll just start to share my thoughts and lessons from my #DataScienceJourney using #Rstats and #Tidyverse right here, without too much filtering or editing.
My first lesson is about using #TidyModels by @topepo, @hadleywickham and the rest of the team.
Today I tried to create a regression model while some of my predictors were in an ordinal scale.
>>>
#datasciencejourney #tidymodels #rstats #tidyverse