home.social

#datascientists — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #datascientists, aggregated by home.social.

  1. The spatial join?: The end of the spatial join? Vikram Gundeti, CTO of #Foursquare, is reimagining #geospatial for #datascientists by eliminating traditional GIS hurdles and embracing #ML-friendly, agent-ready solutions. Will geodata be seamlessly accessible and...
    spatialists.ch/posts/2025/08/0 #GIS #GISchat #geospatial #SwissGIS

  2. Visualizing gene structures in R? gggenes, an extension of ggplot2, simplifies the process of creating clear and informative gene diagrams, making genomic data easier to interpret and share.

    Visualization: cran.r-project.org/web/package

    Click this link for detailed information: statisticsglobe.com/online-cou

    #datastructure #datavisualization #dataanalytics #data #tidyverse #datascientists #ggplot2

  3. Evaluating the normality of your data is crucial in statistical analysis, as many techniques assume that the data and/or residuals follow a normal distribution.

    The visualization in the post contrasts two QQ plots: the left plot shows a data set following a normal distribution, where the points align closely with the reference line.

    Check out this tutorial: statisticsglobe.com/r-qqplot-q

    Click this link for detailed information: statisticsglobe.com/online-cou

    #datascientists #datavisualization #data

  4. The employees also warned that many of those enlisted by #ElonMusk to help him slash the size of the federal government under #Trump’s admin were political ideologues who did not have the necessary skills or experience for the task ahead of them.

    The mass #resignation of #engineers, #DataScientists & #ProductManagers is a temporary setback for #Musk & the Republican president’s tech-driven #purge of the federal workforce.

    #law #USpol #FederalAgencies

  5. Discover how to implement hierarchical clustering in Python with our detailed tutorial. Perfect for #DataScientists and #ML engineers looking to master clustering algorithms. Includes code examples, visualizations, and practical applications. #Programming

    teguhteja.id/hierarchical-clus

  6. Principal Component Analysis (PCA) before Linear Regression can greatly enhance your data analysis process.

    By incorporating PCA before performing linear regression, you can streamline your analysis pipeline and build more robust models that capture the essential relationships within your data.

    I've developed an in-depth course on PCA theory and its application in R programming.

    Further details: statisticsglobe.com/online-cou

    #pythontraining #datascientists #data #bigdata #advancedanalytics

  7. I met the founder of the company and thought it was interesting to publish something about it.
    A New Approach To Training With Perforated Ai medium.com/@luismarcelobp/a-ne

  8. I invite #dataScientists and #computationalBiologists to contribute to Data All The Way! (dataalltheway.com). Share #tutorials, concepts, or projects (with code/Kaggle notebooks) under your name. I’ll help with editing and formatting. Contact me here or via the website to get started!

    Data All The Way

  9. Get ready for our first Metadata Sprint! On 8-9 April 2025 in Madrid, we're bringing together #librarians, #developers, #datascientists, and open infrastructure enthusiasts to co-create and innovate with Crossref #metadata and APIs. Whether you're pitching a project or joining a team, this is your chance to connect, collaborate, and create something impactful. Limited spaces—submit your abstract today. crossref.org/events/api-sprint

  10. Final Remainder it's happening Today:
    September 30, 2024 at 7:00PM CEST
    Data Wrangling Practice with R

    RSVP: meetup.com/rladies-rome/events

    @silacos @Rafagrlucas @fgazzelloni @RLadiesGlobal

  11. Happening Today:

    September 30, 2024 at 7:00PM CEST

    Data Wrangling Practice with R

    RSVP: meetup.com/rladies-rome/events

    @silacos @Rafagrlucas @fgazzelloni @RLadiesGlobal

  12. There is still time to register for an hands-on session on:
    Data Wrangling Practice with R

    When: Tomorrow, Monday September 30, 2024 at 7:00PM CEST

    RSVP: meetup.com/rladies-rome/events

    @silacos @Rafagrlucas @fgazzelloni @RLadiesGlobal

  13. Diving into Principal Component Analysis (PCA) unveils two heroes of data simplification: Eigenvalues and Eigenvectors. These mathematical concepts might sound intimidating, but they're crucial for understanding how PCA transforms complex data into something much more manageable. Let's demystify them:

    Looking to get hands-on with eigenvalues, eigenvectors, and PCA using the R programming language? Unlock the power of your data: statisticsglobe.com/online-cou

    #DataScientists #rprogramming

  14. #Anaconda puts squeeze on #datascientists deemed to be #ToS violators
    Academic, non-profit organizations now being told to pay up – or else
    "Research and non-profits are also the entities providing a lot of the repositories in the anacondaecosystem. I believe Anaconda are currently testing to see what happens if they play hardball with them."
    Source said interaction with company echoed Oracle’s tactics – it became clear licensing fees dating back years could be sought.
    theregister.com/2024/08/08/ana

  15. Such things could not be shared enough!
    - - -
    Dive into Deep Learning (free 1151-page PDF download provided by the author @smolix): alex.smola.org/projects.html
    - - -
    #BigData #DataScience #AI #ML #MachineLearning #DeepLearning #Algorithms #Mathematics #Calculus #NeuralNetworks #Python #Jupyter #DataScientists
    - - -
    via x.com/kirkdborne/status/180642

  16. 🦀 Generating Map Tiles with Rust - How easy is it to transition from Python to Rust?

    "My first foray into Rust was not nearly as difficult as I expected"

    towardsdatascience.com/generat

    #rustlang #python #programmers #DataScientists

  17. Attention #developers, #devops, and #datascientists: cybersecurity challenges touch every corner of the tech world. Are you equipped to handle them? Stay tuned for fresh content from OWASP throughout this year to help you do just that!

    youtube.com/watch?v=0UtvKRkfdq

  18. Hey, I've published a tutorial on how to draw an interactive treemap using the plotly package in the Python programming language. The tutorial was created in collaboration with Ifeanyi Idiaye: statisticsglobe.com/plotly-tre

    #datascientists #pythonprojects #dataanalytic

  19. Hey, I've published a tutorial on how to draw an interactive treemap using the plotly package in the Python programming language. The tutorial was created in collaboration with Ifeanyi Idiaye: statisticsglobe.com/plotly-tre

    #datascientists #pythonprojects #dataanalytic

  20. Hey, I've published a tutorial on how to draw an interactive treemap using the plotly package in the Python programming language. The tutorial was created in collaboration with Ifeanyi Idiaye: statisticsglobe.com/plotly-tre

    #datascientists #pythonprojects #dataanalytic

  21. and this egregious @nytimes survey that is as distant from provable fact as I have ever seen, and goes unchallenged by real #science and #datascientists:

  22. 🔜 🗓️ Our Quantum Computing Professional Training will take place from April 22 to 26. ✍ You can find all the information here, book your place and join us: 📝 fokus.fraunhofer.de/en/akademi

    👉 The course is designed for:
    📌 Software Engineers
    📌 Data Scientists
    📌 QC Researchers
    📌 Technology Scouts

    #FraunhoferITWM
    #QuantumComputing
    #SoftwareEngineering
    #DataScientists
    #TechnologyScouts

  23. What is ? A new and exciting set of skills, necessary for analyzing 21st century data? Or is it (as some have claimed) a rebranding of ?

    "Opinions on data science abound," say Jonathan Auerbach, David Kepplinger, and Nicholas Rios, but "few appear to be based on data or science."

    Auerbach et al. use two popular data science algorithms to examine the difference between , , and other occupations.

    Read more @rwdatasci:
    realworlddatascience.net/ideas

  24. Hey, I've published a tutorial that discusses alternatives to a Principal Component Analysis (PCA) when dealing with categorical data. The tutorial was created in collaboration with Paula Villasante Soriano & Cansu Kebabci: statisticsglobe.com/pca-catego

    #DataScientists #Coding

  25. Hey, I've created a tutorial on how to compare vectors and find differences in the R programming language. The tutorial shows five examples for functions such as identical(), intersect(), and setdiff(): statisticsglobe.com/compare-ve

    #DataScientists #DataScience #rprogramming

  26. Hey, I've created a video tutorial on how to plot frequencies on top of a stacked barplot using the ggplot2 package in the R programming language: m.youtube.com/watch?v=zVsfWysT

    #DataScientists #Package #Programming #ggplot2 #rprogramming

  27. Having #DataScientists Build Infrastructure & Developing Models At The Same Time Is A Terrible Anti-Pattern We’re Addicted To.

    Esp at comps that aren’t early stage -- correlated w/ a lack of technical DS leadership, poor infra design, and lack of organizational alignment.

    Really shows how the difference between success & failure isn’t technology choices but good project management & strategic leadership around platforms.

    #mlops #mlengineering #mlplatforms #datascience

  28. Hey, I've created a video tutorial on how to draw multiple boxplots in the same graph using the R programming language. The tutorial shows examples for Base R, ggplot2 & the lattice package: m.youtube.com/watch?v=K-GKhyy8

    #tidyverse #package #rstats #datavisualization #datascientists #ggplot2 #dataanalytic

  29. I'm excited to see #gradientboosting making some news! There is so much #aihype around #llms (and before that it was #deeplearning) but I think that for most #datascientists working in industry the development of #gradientboosting #machinelearning algorithms (like #xgboost and #catboost) are the real revolution and will have a much more long lived impact on our work.

    nature.com/articles/s41598-022

  30. RT @BazeleyMikiko: 🤔 Do you think one of the reasons why your #datascientists aren't adopting your internal #mlops or #productionml tools is b/c the interface is hard-to-use

  31. 🤔 Do you think one of the reasons why your #datascientists aren't adopting your internal #mlops or #productionml tools is b/c the interface is hard-to-use

  32. 🐍🌞 Having some free time this summer and want to learn #Python? I’m happy to share the material 📚 of our past course on Python Basics for #DataScientists. It’s designed to start from scratch 🚀, ready for #selfstudy, and requires no prior programming knowledge. Feel free to use and share it 😙

    🌎 fabriziomusacchio.com/teaching

    #LearnPython #SummerLearning #DataScience

  33. Hey, I've published a video tutorial on how to create interactive boxplots using the plotly package in the R programming language. The tutorial was created in collaboration with Kirby White: m.youtube.com/watch?v=ndn58q9c

    #visualanalytics #rprogramminglanguage #datascientists #package

  34. We need fewer #DataScientists and more #DataAnalysts if we ever hope to have useful #AI / #MachineLearning. If you plow whatever cheap and easy data into the training set, then "garbage in, garbage out". Creating useful and quality data sets for training and validation takes extensive human effort and is a very worthwhile pursuit.

  35. Our #DataPreview 🈸 for #vscode now has over 350,000 installs. You can load large CSV files, sort & graph results with aggregate functions, and much more.

    See an example of loading 48MB of #ChicagoCrimes CSV data: twitter.com/TarasNovak/status/

    Note: change data.preview.theme to light. See: github.com/RandomFractals/vsco

    📥 marketplace.visualstudio.com/i

    #dataViz 📊📈 #dataTools 🛠️ for #dataScientists ...

  36. Comet AI nabs $4.5M for more efficient machine learning model management - As we get further along in the new way of working, the new normal if you will, finding more efficien... more: feedproxy.google.com/~r/Techcr #artificialintelligence #machinelearning #newyorkstartups #datascientists #enterprise #startups #funding #cometai #cloud

  37. I made a dumb little meme to try & prompt a bit of conversation about how to bridge the #skillsgap with #Apprenticeships

    I really want to connect with #UK folk who need #DataScientists (including #Bioinformatics & #ComputationalBiology) #LabScientists #SeniorLeaders #DigitalMarketers & #Managers

    I hear a lot about #skills shortages, but see less action to address them.

    I've got access to a big part of the puzzle, but I need exceptional partners to work with on it.

  38. Learn why Teal is the tool for , , and drug safety specialists looking to gain insights from their clinical trial data. | Register today for the free webinar (Feb 23 at 8 am PST) ➡️ r-consortium.org/webinars

  39. #bioinformatics #computationalbiology #DataScientists #cancer_research #drugdiscovery #Omics #Transcriptomics #Genomics People.

    We've refreshed and updated our undergraduate Bioinformatics #Apprenticeship course and are seeking partners who want to train new #Bioinformaticians in their #UK (well, English..) workforce, there's more info here and if you like what you see and want to know more just get in touch.

    aru.ac.uk/study/degree-apprent

  40. Hey, I've created a video tutorial on how to calculate certain summary statistics for a DataFrame using the pandas library in the Python programming language: m.youtube.com/watch?v=bdygwr2D

    #statistics #pythoncourse #datascientists