home.social

#dataexploration — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #dataexploration, aggregated by home.social.

  1. The map-enabled data exploration tool *mapdata.py* (pypi.org/project/mapdata/) will now produce a Lorenz curve for any numeric variable, optionally segregated by any categorical variable.

    #DataAnalysis #DataExploration #DataViz #Plotting #LorenzCurve #Python #FOSS #FLOSS

  2. The map-enabled data exploration tool *mapdata.py* (pypi.org/project/mapdata/) will now produce a Lorenz curve for any numeric variable, optionally segregated by any categorical variable.

    #DataAnalysis #DataExploration #DataViz #Plotting #LorenzCurve #Python #FOSS #FLOSS

  3. The map-enabled data exploration tool *mapdata.py* (pypi.org/project/mapdata/) will now produce a Lorenz curve for any numeric variable, optionally segregated by any categorical variable.

    #DataAnalysis #DataExploration #DataViz #Plotting #LorenzCurve #Python #FOSS #FLOSS

  4. The map-enabled data exploration tool *mapdata.py* (pypi.org/project/mapdata/) will now produce a Lorenz curve for any numeric variable, optionally segregated by any categorical variable.

    #DataAnalysis #DataExploration #DataViz #Plotting #LorenzCurve #Python #FOSS #FLOSS

  5. The map-enabled data exploration tool *mapdata.py* (pypi.org/project/mapdata/) will now produce a Lorenz curve for any numeric variable, optionally segregated by any categorical variable.

    #DataAnalysis #DataExploration #DataViz #Plotting #LorenzCurve #Python #FOSS #FLOSS

  6. “Looking is not the same as seeing, after all, and this may be why some of us may stumble upon discoveries in data that others have already analyzed.” #stats #statistics #DataExploration

    RE: https://bsky.app/profile/did:plc:hfp3die5v5ojjt3bdm2luips/post/3mj4fdnzyms2j

  7. RE: floss.social/@rdnielsen/116365

    The third and last of the series on unmixing using NMF is now posted at

    dblog.vitumbre.tech/dart/unmix

    Part 3 illustrates the variability of results that can occur when repeatedly unmixing the same data set, and presents approaches to addressing the resultant uncertainty.

    #DataAnalysis #DataExploration #Unmixing #NMF #Python #JuliaLang #RStats

  8. RE: floss.social/@rdnielsen/116365

    The third and last of the series on unmixing using NMF is now posted at

    dblog.vitumbre.tech/dart/unmix

    Part 3 illustrates the variability of results that can occur when repeatedly unmixing the same data set, and presents approaches to addressing the resultant uncertainty.

    #DataAnalysis #DataExploration #Unmixing #NMF #Python #JuliaLang #RStats

  9. RE: floss.social/@rdnielsen/116365

    The third and last of the series on unmixing using NMF is now posted at

    dblog.vitumbre.tech/dart/unmix

    Part 3 illustrates the variability of results that can occur when repeatedly unmixing the same data set, and presents approaches to addressing the resultant uncertainty.

    #DataAnalysis #DataExploration #Unmixing #NMF #Python #JuliaLang #RStats

  10. RE: floss.social/@rdnielsen/116365

    The third and last of the series on unmixing using NMF is now posted at

    dblog.vitumbre.tech/dart/unmix

    Part 3 illustrates the variability of results that can occur when repeatedly unmixing the same data set, and presents approaches to addressing the resultant uncertainty.

    #DataAnalysis #DataExploration #Unmixing #NMF #Python #JuliaLang #RStats

  11. RE: floss.social/@rdnielsen/116365

    The third and last of the series on unmixing using NMF is now posted at

    dblog.vitumbre.tech/dart/unmix

    Part 3 illustrates the variability of results that can occur when repeatedly unmixing the same data set, and presents approaches to addressing the resultant uncertainty.

    #DataAnalysis #DataExploration #Unmixing #NMF #Python #JuliaLang #RStats

  12. RE: floss.social/@rdnielsen/116363

    Part 2 of this series on unmixing is now available:

    dblog.vitumbre.tech/dart/unmix

    Part 2 addresses the challenge of deciding how many end members are in a data set, recommends algorithms for Python, Julia, and R, and illustrates how several factors affect that determination.

    #DataAnalysis #DataExploration #Unmixing #NMF #Python #JuliaLang #RStats

  13. RE: floss.social/@rdnielsen/116363

    Part 2 of this series on unmixing is now available:

    dblog.vitumbre.tech/dart/unmix

    Part 2 addresses the challenge of deciding how many end members are in a data set, recommends algorithms for Python, Julia, and R, and illustrates how several factors affect that determination.

    #DataAnalysis #DataExploration #Unmixing #NMF #Python #JuliaLang #RStats

  14. RE: floss.social/@rdnielsen/116363

    Part 2 of this series on unmixing is now available:

    dblog.vitumbre.tech/dart/unmix

    Part 2 addresses the challenge of deciding how many end members are in a data set, recommends algorithms for Python, Julia, and R, and illustrates how several factors affect that determination.

    #DataAnalysis #DataExploration #Unmixing #NMF #Python #JuliaLang #RStats

  15. RE: floss.social/@rdnielsen/116363

    Part 2 of this series on unmixing is now available:

    dblog.vitumbre.tech/dart/unmix

    Part 2 addresses the challenge of deciding how many end members are in a data set, recommends algorithms for Python, Julia, and R, and illustrates how several factors affect that determination.

    #DataAnalysis #DataExploration #Unmixing #NMF #Python #JuliaLang #RStats

  16. RE: floss.social/@rdnielsen/116363

    Part 2 of this series on unmixing is now available:

    dblog.vitumbre.tech/dart/unmix

    Part 2 addresses the challenge of deciding how many end members are in a data set, recommends algorithms for Python, Julia, and R, and illustrates how several factors affect that determination.

    #DataAnalysis #DataExploration #Unmixing #NMF #Python #JuliaLang #RStats

  17. I just posted part 1 of a 3-part series on unmixing of data sets using non-negative matrix factorization.

    dblog.vitumbre.tech/dart/unmix

    Part 1 contains implementations in Python, Julia, and R, and includes an assessment of the relative accuracy of these implementations.

    Parts 2 and 3 will follow shortly, and will contain more detail on the identification of, and accurate characterization of, unmixing end members.

    #DataAnalysis #DataExploration #Python #JuliaLang #RStats #Unmixing #NMF

  18. I just posted part 1 of a 3-part series on unmixing of data sets using non-negative matrix factorization.

    dblog.vitumbre.tech/dart/unmix

    Part 1 contains implementations in Python, Julia, and R, and includes an assessment of the relative accuracy of these implementations.

    Parts 2 and 3 will follow shortly, and will contain more detail on the identification of, and accurate characterization of, unmixing end members.

    #DataAnalysis #DataExploration #Python #JuliaLang #RStats #Unmixing #NMF

  19. I just posted part 1 of a 3-part series on unmixing of data sets using non-negative matrix factorization.

    dblog.vitumbre.tech/dart/unmix

    Part 1 contains implementations in Python, Julia, and R, and includes an assessment of the relative accuracy of these implementations.

    Parts 2 and 3 will follow shortly, and will contain more detail on the identification of, and accurate characterization of, unmixing end members.

    #DataAnalysis #DataExploration #Python #JuliaLang #RStats #Unmixing #NMF

  20. I just posted part 1 of a 3-part series on unmixing of data sets using non-negative matrix factorization.

    dblog.vitumbre.tech/dart/unmix

    Part 1 contains implementations in Python, Julia, and R, and includes an assessment of the relative accuracy of these implementations.

    Parts 2 and 3 will follow shortly, and will contain more detail on the identification of, and accurate characterization of, unmixing end members.

    #DataAnalysis #DataExploration #Python #JuliaLang #RStats #Unmixing #NMF

  21. I just posted part 1 of a 3-part series on unmixing of data sets using non-negative matrix factorization.

    dblog.vitumbre.tech/dart/unmix

    Part 1 contains implementations in Python, Julia, and R, and includes an assessment of the relative accuracy of these implementations.

    Parts 2 and 3 will follow shortly, and will contain more detail on the identification of, and accurate characterization of, unmixing end members.

    #DataAnalysis #DataExploration #Python #JuliaLang #RStats #Unmixing #NMF

  22. The mapdata.py data explorer now has two new ways of summarizing missing data. It can also now create a Zipf's Law plot for any categorical variable. Install, update, or download it from PyPI: pypi.org/project/mapdata/

    #DataExploration #DataAnalysis #Mapping #Plotting #Statistics #FOSS #FLOSS

  23. 🔍 Exploring groundwater chemistry — from ions to equilibrium

    This ternary diagram shows how groundwater samples affected by mine water vary in anion composition. Each point represents one sample, colored by its calcite saturation index (SI) from PHREEQC calculations.

    Such early-stage exploration helps reveal subtle geochemical trends — where equilibrium breaks down, reactions intensify, and contamination fronts begin to form.

    🧪 Data exploration: PHREEQC + R

    #Geochemistry #Hydrogeology #MineWater #Groundwater #PHREEQC #DataExploration #EnvironmentalGeochemistry #GeochemicalModeling #DataVisualization #RStats #OpenScience #SvystunovaGully

  24. I'm not really sure when @micahflee made his Hacks, Leaks, and Revelations book free to read online, but if it's been on your wish list, now's your chance to give it a read, and if you enjoy it, and can afford to, support the author.

    hacksandleaks.com/contents.htm

    Buy here: hacksandleaks.com/

    #data #dataviz #DataVisualiaztion #DataExploration #books #HacksAndLeaks

  25. I'm not really sure when @micahflee made his Hacks, Leaks, and Revelations book free to read online, but if it's been on your wish list, now's your chance to give it a read, and if you enjoy it, and can afford to, support the author.

    hacksandleaks.com/contents.htm

    Buy here: hacksandleaks.com/

    #data #dataviz #DataVisualiaztion #DataExploration #books #HacksAndLeaks

  26. I'm not really sure when @micahflee made his Hacks, Leaks, and Revelations book free to read online, but if it's been on your wish list, now's your chance to give it a read, and if you enjoy it, and can afford to, support the author.

    hacksandleaks.com/contents.htm

    Buy here: hacksandleaks.com/

    #data #dataviz #DataVisualiaztion #DataExploration #books #HacksAndLeaks

  27. I'm not really sure when @micahflee made his Hacks, Leaks, and Revelations book free to read online, but if it's been on your wish list, now's your chance to give it a read, and if you enjoy it, and can afford to, support the author.

    hacksandleaks.com/contents.htm

    Buy here: hacksandleaks.com/

    #data #dataviz #DataVisualiaztion #DataExploration #books #HacksAndLeaks

  28. I'm not really sure when @micahflee made his Hacks, Leaks, and Revelations book free to read online, but if it's been on your wish list, now's your chance to give it a read, and if you enjoy it, and can afford to, support the author.

    hacksandleaks.com/contents.htm

    Buy here: hacksandleaks.com/

    #data #dataviz #DataVisualiaztion #DataExploration #books #HacksAndLeaks

  29. I'm continuing to play with my music listening data, and i suspect that spotify (2017-2021) and plex (2022+) handle time zones differently and that I'm not properly accounting for that difference.

    #DataExploration #PowerBI #MicrosoftFabric

  30. Fast and easy data exploration using Graphic-Walker Data Explorer with a Tableau-like drag & drop interface. Simply upload your CSV file and start exploring without writing code

    graphic-walker-data-explorer.n

    #dataviz #dataexploration #dashboard #rstats #rshiny #software

  31. New features in the data explorer MapData.py (pypi.org/project/mapdata/):

    1. Saturation, contrast, and brightness of basemap images can be customized.

    2. Hovering over a point on a scatter plot will display the label for that point.

    3. Data can be recoded to edit the values in an existing column or to add a column with new values to the data table.

    #Mapdata #Mapping #DataExploration #Python #FOSS

  32. When a spatial data set contains multiple values at a location (e.g., from different dates or depths/elevations), the number of data points at a location, and even the presence of multiple data points at a location, may not be apparent on a 2-D map. The latest version of MapData.py (pypi.org/project/mapdata/) addresses this situation in five ways:

    1/6

    #Mapping #MapData #DataAnalysis #DataExploration #DataPlotting #Python #FOSS

  33. I ran into issues with @geopandas explore today when the included datetime columns. Still have to check if this is a known issue and then create a reproducible example

  34. say one had a #database of books with 50k records in it. What would you use to explore / manipulate it?

    Thinking more along the lines of a GUI rather than pandas, etc.

    Extra points if it can take snapshots of the db on command and one can easily compare/merge (think git diffs) the snapshots.

    #question #TooSpecificMaybe #DataAnalysis #DataVisualization #DataExploration

  35. say one had a #database of books with 50k records in it. What would you use to explore / manipulate it?

    Thinking more along the lines of a GUI rather than pandas, etc.

    Extra points if it can take snapshots of the db on command and one can easily compare/merge (think git diffs) the snapshots.

    #question #TooSpecificMaybe #DataAnalysis #DataVisualization #DataExploration

  36. say one had a of books with 50k records in it. What would you use to explore / manipulate it?

    Thinking more along the lines of a GUI rather than pandas, etc.

    Extra points if it can take snapshots of the db on command and one can easily compare/merge (think git diffs) the snapshots.

  37. say one had a #database of books with 50k records in it. What would you use to explore / manipulate it?

    Thinking more along the lines of a GUI rather than pandas, etc.

    Extra points if it can take snapshots of the db on command and one can easily compare/merge (think git diffs) the snapshots.

    #question #TooSpecificMaybe #DataAnalysis #DataVisualization #DataExploration

  38. say one had a #database of books with 50k records in it. What would you use to explore / manipulate it?

    Thinking more along the lines of a GUI rather than pandas, etc.

    Extra points if it can take snapshots of the db on command and one can easily compare/merge (think git diffs) the snapshots.

    #question #TooSpecificMaybe #DataAnalysis #DataVisualization #DataExploration