home.social

#tidyverse — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #tidyverse, aggregated by home.social.

  1. Wait! You can wrap pipelines in R in parentheses to directly use the last value?!?

    #rlang #tidyverse #R

  2. | Distributions – Physiscal | | Density Distribution of Temperatures in Central America, source: World Bank Climate Portal. Built with using , and .

  3. | Comparaciones – RSF-Data Day | . Cambio en score de libertad de prensa 2024–2025 en paises de América. Creada usando R con , , , , y .

  4. #Día6 | Comparaciones – RSF-Data Day | #30DayChartChallenge. Cambio en score de libertad de prensa 2024–2025 en paises de América. Creada usando R con #tidyverse, #ggtext, #scales, #rnaturalearth, #rnaturalearthdata y #patchwork.

  5. #Día6 | Comparaciones – RSF-Data Day | #30DayChartChallenge. Cambio en score de libertad de prensa 2024–2025 en paises de América. Creada usando R con #tidyverse, #ggtext, #scales, #rnaturalearth, #rnaturalearthdata y #patchwork.

  6. #Día6 | Comparaciones – RSF-Data Day | #30DayChartChallenge. Cambio en score de libertad de prensa 2024–2025 en paises de América. Creada usando R con #tidyverse, #ggtext, #scales, #rnaturalearth, #rnaturalearthdata y #patchwork.

  7. #Día6 | Comparaciones – RSF-Data Day | #30DayChartChallenge. Cambio en score de libertad de prensa 2024–2025 en paises de América. Creada usando R con #tidyverse, #ggtext, #scales, #rnaturalearth, #rnaturalearthdata y #patchwork.

  8. Having gotten my head fully around R pipes, I feel I need to write an article about it before I forget the fiddly details:

    |>, %>%, with(), lambdas, %$%, ...

    there are a lot of clever tricks for edge cases! (The with() hack I should have figured out for myself though!)

    Also, %$% -- the exposition pipe -- why didn't any of you tell me about this one!?!? You're slacking!

    #rlang #datascience #stats #statistics #rprogramming #tidyverse #tidydata #Rpipes #pipes!

  9. Ok, why do people keep doing this:

    library(tidyverse)
    library(lubridate)

    when the first call automatically loads the package in the second call? Am I missing something here?

    I see this **everywhere**!

    #rlang #rlanguage #tidyverse #lubridate

  10. [Перевод] Почему Python — не лучший язык для data science. Часть 2 — Python против R

    Команда Python for Devs подготовила перевод статьи о том, почему Python — несмотря на свою популярность — не всегда идеален для Data Science. Автор показывает, как отсутствие нестандартной оценки выражений усложняет анализ данных, и сравнивает Python с R, где такие задачи решаются куда элегантнее.

    habr.com/ru/articles/971372/

    #python #datascience #tidyverse #pandas #векторизация #polars

  11. Na #PythonCerrado2025, tivemos ontem um excelente tutorial do Lucas Marcondes Pavelski github.com/lucasmpavelski.

    Aprendemos sobre #R, #tidyverse, #reticulate, várias ferramentas essenciais como #ggplot2 e #dplyr, vendo na prática como aplicá-las. Foco na ponte #Python <-> R.

    Tudo novidade pra mim, vieram várias ideias interessantes de análises e plots.

    #PythonCerrado

  12. I was annoyed that there is no "expand_grid()" function in :python: #Python as in :rstats: #RStats #tidyverse

    So I just published a small package on #PyPI !

    Introducing polarsgrid
    pypi.org/project/polarsgrid/

    Using the excellent #polars 🐻‍❄️ package, easily create a table with product of factors:

    from polarsgrid import expand_grid
    expand_grid(a=[1, 2, 3], b=["x", "y"])

    Yields all combinations of its inputs as a #DataFrame

    It can also produce a #LazyFrame for streaming extra-big tables to disk

  13. I was annoyed that there is no "expand_grid()" function in :python: as in :rstats:

    So I just published a small package on !

    Introducing polarsgrid
    pypi.org/project/polarsgrid/

    Using the excellent 🐻‍❄️ package, easily create a table with product of factors:

    from polarsgrid import expand_grid
    expand_grid(a=[1, 2, 3], b=["x", "y"])

    Yields all combinations of its inputs as a

    It can also produce a for streaming extra-big tables to disk

  14. I was annoyed that there is no "expand_grid()" function in :python: #Python as in :rstats: #RStats #tidyverse

    So I just published a small package on #PyPI !

    Introducing polarsgrid
    pypi.org/project/polarsgrid/

    Using the excellent #polars 🐻‍❄️ package, easily create a table with product of factors:

    from polarsgrid import expand_grid
    expand_grid(a=[1, 2, 3], b=["x", "y"])

    Yields all combinations of its inputs as a #DataFrame

    It can also produce a #LazyFrame for streaming extra-big tables to disk

  15. I was annoyed that there is no "expand_grid()" function in :python: #Python as in :rstats: #RStats #tidyverse

    So I just published a small package on #PyPI !

    Introducing polarsgrid
    pypi.org/project/polarsgrid/

    Using the excellent #polars 🐻‍❄️ package, easily create a table with product of factors:

    from polarsgrid import expand_grid
    expand_grid(a=[1, 2, 3], b=["x", "y"])

    Yields all combinations of its inputs as a #DataFrame

    It can also produce a #LazyFrame for streaming extra-big tables to disk

  16. I was annoyed that there is no "expand_grid()" function in :python: #Python as in :rstats: #RStats #tidyverse

    So I just published a small package on #PyPI !

    Introducing polarsgrid
    pypi.org/project/polarsgrid/

    Using the excellent #polars 🐻‍❄️ package, easily create a table with product of factors:

    from polarsgrid import expand_grid
    expand_grid(a=[1, 2, 3], b=["x", "y"])

    Yields all combinations of its inputs as a #DataFrame

    It can also produce a #LazyFrame for streaming extra-big tables to disk

  17. At first glance, bar charts might seem like a simple visualization type. But with a little creativity, they can be enhanced in countless ways to reveal deeper insights and make your data shine.

    The attached visual highlights a variety of bar chart styles to inspire your work.

    Take a look here for more details: statisticsglobe.com/online-cou

    #datastructure #data #tidyverse #rstats #package #datasciencetraining

  18. Что в чёрной коробочке? Выясняем самостоятельно, не привлекая внимания коллег

    Всем привет, меня зовут Миша, и я разрабатываю платформу Яндекс Еды. Первые компоненты были написаны почти 10 лет назад (когда Еда ещё была стартапом Foodfox), и у нас накопилось много кода, который просто хорошо работает, а иногда даже «работает — не трогай». Но в процессе развития и устоявшиеся части системы нужно трогать, про что мои коллеги уже писали — как мы повышали версию PHP , пилили монолит и снимали нагрузку с БД . Наконец настал черёд рассказать про процессинг заказов доставки еды из кафе и ресторанов (а также продуктов из магазинов и многого другого). За годы эволюционного развития он значительно разросся, что стало заметно затруднять дальнейшее развитие — например, изменения, связанные с выходом на новые рынки, — а также влиять на надёжность. Поэтому мы решили вынести процессинг заказа в отдельный специализированный сервис. Чтобы определить, что выносить, а что оставлять, нужно было составить исчерпывающий и актуальный список процессов, которые происходят с заказом. И здесь мы столкнулись с вызовом: это знание распределено по многим людям и документам, поскольку на протяжении долгого времени в процессинг заказов вносили изменения многие команды. И перед нами встал вопрос — как собрать нужную информацию о системе с заметной долей легаси быстро, да так, чтобы информация была актуальна?

    habr.com/ru/companies/yandex/a

    #process_mining #анализ_данных #tidyverse #триз #рефакторинг #duckdb #архитектура_по #яндекс_еда #монолит

  19. Using dplyr and ggplot2 in R can significantly streamline your data analysis process, making it easier to work with complex data sets.

    I have created a video tutorial in collaboration with Albert Rapp, where I demonstrate how to do this in practice: youtube.com/watch?v=EKISB0gnue4

    #coding #datavisualization #rprogramming #dataviz #statisticalanalysis #package #datastructure #ggplot2 #bigdata #tidyverse

  20. I recently discovered the tidyplots package in R, and it’s impressive how effortlessly it enables you to create beautiful, publication-ready plots.

    The example visualizations shown here were created by the package author, Jan Broder Engler, and are featured on the tidyplots website: jbengler.github.io/tidyplots/

    Click this link for detailed information: statisticsglobe.com/online-cou

    #statisticsclass #datavisualization #advancedanalytics #rprogramminglanguage #visualanalytics #package #tidyverse

  21. I've talked about creating data.frames and tibbles before, but it is an important topic so I have covered it again. This time specifically from the perspective of creating them from vectors. Post: www.spsanderson.com/steveondata/... #R #RStats #tibble #dplyr #tidyverse #dataframe #baseR #blog

  22. I've talked about creating data.frames and tibbles before, but it is an important topic so I have covered it again. This time specifically from the perspective of creating them from vectors. Post: www.spsanderson.com/steveondata/... #R #RStats #tibble #dplyr #tidyverse #dataframe #baseR #blog

  23. I've talked about creating data.frames and tibbles before, but it is an important topic so I have covered it again. This time specifically from the perspective of creating them from vectors. Post: www.spsanderson.com/steveondata/... #R #RStats #tibble #dplyr #tidyverse #dataframe #baseR #blog

  24. I've talked about creating data.frames and tibbles before, but it is an important topic so I have covered it again. This time specifically from the perspective of creating them from vectors. Post: www.spsanderson.com/steveondata/... #R #RStats #tibble #dplyr #tidyverse #dataframe #baseR #blog

  25. I've talked about creating data.frames and tibbles before, but it is an important topic so I have covered it again. This time specifically from the perspective of creating them from vectors. Post: www.spsanderson.com/steveondata/... #R #RStats #tibble #dplyr #tidyverse #dataframe #baseR #blog

  26. Make your plots more stylish and visually appealing! The ggthemes package offers a variety of pre-built themes that help you customize the look of your ggplot2 visualizations, drawing inspiration from popular design standards.

    The visualization shown here is from the package website: yutannihilation.github.io/allY

    More: statisticsglobe.com/online-cou

    #datascienceeducation #coding #visualanalytics #tidyverse #ggplot2 #package

  27. Creating publication-ready plots in R is easier than ever with ggpubr. This extension for ggplot2 simplifies the process of generating clean and professional graphics, especially for exploratory data analysis and reporting.

    The attached visual, which I created using ggpubr, demonstrates its versatility.

    Additional information: statisticsglobe.com/online-cou

    #bigdata #visualanalytics #tidyverse #programming #statisticalanalysis #datavisualization #package #data #ggplot2

  28. If you are looking for data processors to get your data in line for the algo in question, then my #R #package { healthyR.ai } has you covered. These are based on using #tidymodels #parsnip from the #tidyverse www.spsanderson.com/healthyR.ai/... #RStats #Data #ModelData

  29. Day 10 | Distributions / Multi – Modal | . Visualization made with R using , , , , , and . Data source: Sentinel-2 MSI (2024)

  30. Day 7 | Distributions– Outliers | . Visualization made with R using , , , , y . Data source: Sentinel-2 MSI (2019-2024)

  31. Day 6 | Comparisons – Florence Nightingale (theme day) | . Visualization made with R using , and . Data source: HDX - data.humdata.org/dataset/cod-p.

  32. ggplot2 is the gold standard when it comes to data visualization.

    The image in this post showcases examples of ggplot2 visualizations, demonstrating its versatility to create a wide range of plots with nearly limitless customization options.

    Check out my online course, "Data Visualization in R Using ggplot2 & Friends," for a deeper dive into creating stunning plots with ggplot2.

    More info: statisticsglobe.com/online-cou

    #package #dataviz #statistical #tidyverse #pythondeveloperjobs

  33. Day 2 | Comparisons – Slope | . Analysis develop with R using , , , , , , y . Data source: Sentinel-2 MSI (2019-2024)

  34. gganimate is a powerful extension for ggplot2 that transforms static visualizations into dynamic animations. By adding a time dimension, it allows you to illustrate trends, changes, and patterns in your data more effectively.

    The attached animated visualization, which I created with gganimate, showcases a ranked bar chart of the top 3 countries for each year based on inflation since 1980.

    More information: statisticsglobe.com/online-cou

    #datastructure #datavisualization #tidyverse #ggplot2

  35. Understanding probability distributions is key to making informed decisions in statistics and data science. Probability distributions describe how the values of a variable are expected to behave, making them crucial for interpreting data and predicting outcomes.

    The visualization shown in this post illustrates the distributions.

    Further details: statisticsglobe.com/online-cou

    #dataanalytic #tidyverse #rstats #datavisualization

  36. Visualizing gene structures in R? gggenes, an extension of ggplot2, simplifies the process of creating clear and informative gene diagrams, making genomic data easier to interpret and share.

    Visualization: cran.r-project.org/web/package

    Click this link for detailed information: statisticsglobe.com/online-cou

    #datastructure #datavisualization #dataanalytics #data #tidyverse #datascientists #ggplot2

  37. Local regression is a non-parametric method for fitting smooth curves to data by applying multiple localized regressions. It is useful for uncovering non-linear relationships when the data’s exact form is unknown. Proper use of local regression can reveal trends in noisy data, but poor implementation might lead to misleading results.

    Image: en.wikipedia.org/wiki/Local_re

    More details: eepurl.com/gH6myT

    #database #package #bigdata #businessanalyst #tidyverse #datavisualization #rprogramming