home.social

Search

1000 results for “Sarah_Lea”

  1. There is no need to move data. Data latency is minimised. Data can be transformed and analysed within a single platform.

    Let me know what you know about Zero-ETL :blobcoffee:

    Why ETL-Zero? Understanding the shift in Data Integration“ by Sarah Lea on Medium: medium.com/towards-data-scienc

    #python #datalake #cloudcomputing #etl #zeroetl #salesforce #data #tech #technology #datawarehousing #datalakehouse

  2. Anyone working with business intelligence, data science, data analysis, or cloud computing will have come across SQL at some point. Take a deep dive into data lakehouses, SQL, data modeling + more in Sarah Lea's latest article.

    #DataLakehouse

    towardsdatascience.com/sql-and

  3. Regex vs. LLM for B2B document extraction. This week, I tried out both.

    :blobcoffee: The rule-based pipeline with pytesseract + regex worked perfectly for Layout A. For Layout B? Every single field returned None.

    :blobcoffee: Because "PO Number" and "Order Reference" are the same thing for a human. Not for a regex pattern.

    :blobcoffee: The LLM-based approach (pytesseract + Ollama + LLaMA 3) extracted both layouts correctly, without touching a single rule. It even normalized the date format automatically.

    :blobcoffee: But LLMs aren't always the right answer. If your documents are stable, speed matters at scale, or explainability is required, regex might still win.

    Full comparison with code and trade-off breakdown on TDS: shorturl.at/v4gdl

  4. Regex vs. LLM for B2B document extraction. This week, I tried out both.

    :blobcoffee: The rule-based pipeline with pytesseract + regex worked perfectly for Layout A. For Layout B? Every single field returned None.

    :blobcoffee: Because "PO Number" and "Order Reference" are the same thing for a human. Not for a regex pattern.

    :blobcoffee: The LLM-based approach (pytesseract + Ollama + LLaMA 3) extracted both layouts correctly, without touching a single rule. It even normalized the date format automatically.

    :blobcoffee: But LLMs aren't always the right answer. If your documents are stable, speed matters at scale, or explainability is required, regex might still win.

    Full comparison with code and trade-off breakdown on TDS: shorturl.at/v4gdl

    #Python #DataScience #business #technology #dataengineering #LLM #Automation #OCR

  5. Regex vs. LLM for B2B document extraction. This week, I tried out both.

    :blobcoffee: The rule-based pipeline with pytesseract + regex worked perfectly for Layout A. For Layout B? Every single field returned None.

    :blobcoffee: Because "PO Number" and "Order Reference" are the same thing for a human. Not for a regex pattern.

    :blobcoffee: The LLM-based approach (pytesseract + Ollama + LLaMA 3) extracted both layouts correctly, without touching a single rule. It even normalized the date format automatically.

    :blobcoffee: But LLMs aren't always the right answer. If your documents are stable, speed matters at scale, or explainability is required, regex might still win.

    Full comparison with code and trade-off breakdown on TDS: shorturl.at/v4gdl

    #Python #DataScience #business #technology #dataengineering #LLM #Automation #OCR

  6. Regex vs. LLM for B2B document extraction. This week, I tried out both.

    :blobcoffee: The rule-based pipeline with pytesseract + regex worked perfectly for Layout A. For Layout B? Every single field returned None.

    :blobcoffee: Because "PO Number" and "Order Reference" are the same thing for a human. Not for a regex pattern.

    :blobcoffee: The LLM-based approach (pytesseract + Ollama + LLaMA 3) extracted both layouts correctly, without touching a single rule. It even normalized the date format automatically.

    :blobcoffee: But LLMs aren't always the right answer. If your documents are stable, speed matters at scale, or explainability is required, regex might still win.

    Full comparison with code and trade-off breakdown on TDS: shorturl.at/v4gdl

    #Python #DataScience #business #technology #dataengineering #LLM #Automation #OCR

  7. Regex vs. LLM for B2B document extraction. This week, I tried out both.

    :blobcoffee: The rule-based pipeline with pytesseract + regex worked perfectly for Layout A. For Layout B? Every single field returned None.

    :blobcoffee: Because "PO Number" and "Order Reference" are the same thing for a human. Not for a regex pattern.

    :blobcoffee: The LLM-based approach (pytesseract + Ollama + LLaMA 3) extracted both layouts correctly, without touching a single rule. It even normalized the date format automatically.

    :blobcoffee: But LLMs aren't always the right answer. If your documents are stable, speed matters at scale, or explainability is required, regex might still win.

    Full comparison with code and trade-off breakdown on TDS: shorturl.at/v4gdl

    #Python #DataScience #business #technology #dataengineering #LLM #Automation #OCR

  8. Most ML issues are not model problems. They are data problems.

    I retrained the same churn model twice.
    Same code. Same path to the data.
    Different result.

    Why? Because of mutable data references.

    :blobcoffee: I wrote a small Data Lake vs Data Lakehouse demo showing why versioned data makes ML debugging reproducible: tinyurl.com/lake-vs-lakehouse-

    :blobcoffee: Friend-Link: medium.com/towards-artificial-

    #ai #machinelearning #data #lakehouse #warehouse #python #datalake #technology #regression

  9. In a data warehouse you store structured & organized data. In a data lake you can additionally store unstructured data. And was is now a data lakehouse?

    Think of a combination of the strengths of both previous data platforms. :blobcoffee:

    towardsdatascience.com/sql-and

    #data #DataEngineering #datalakehouse #datacenters #datawarehouse #datalake #datascience #sql

  10. THE COMPLETE DETECTIVE SARAH BURKE books "Strong female lead. Southwestern heat. Dark secrets buried deep" Sale: $4.99 to $1.99 by ELIZABETH GUNN Rating: 4.6/5 (1,515 Reviews) #Mystery #Thriller #Crime #PoliceProcedural #FemaleLead #Tucson #Kidnapping #Books #BoxSet #BookSky

    THE COMPLETE DETECTIVE SARAH B...

  11. THE COMPLETE DETECTIVE SARAH BURKE books "Strong female lead. Southwestern heat. Dark secrets buried deep" Sale: $4.99 to $1.99 by ELIZABETH GUNN Rating: 4.6/5 (1,515 Reviews) #Mystery #Thriller #Crime #PoliceProcedural #FemaleLead #Tucson #Kidnapping #Books #BoxSet #BookSky

    THE COMPLETE DETECTIVE SARAH B...

  12. THE COMPLETE DETECTIVE SARAH BURKE books "Strong female lead. Southwestern heat. Dark secrets buried deep" Sale: $4.99 to $1.99 by ELIZABETH GUNN Rating: 4.6/5 (1,515 Reviews) #Mystery #Thriller #Crime #PoliceProcedural #FemaleLead #Tucson #Kidnapping #Books #BoxSet #BookSky

    THE COMPLETE DETECTIVE SARAH B...

  13. THE COMPLETE DETECTIVE SARAH BURKE books "Strong female lead. Southwestern heat. Dark secrets buried deep" Sale: $4.99 to $1.99 by ELIZABETH GUNN Rating: 4.6/5 (1,515 Reviews) #Mystery #Thriller #Crime #PoliceProcedural #FemaleLead #Tucson #Kidnapping #Books #BoxSet #BookSky

    THE COMPLETE DETECTIVE SARAH B...

  14. Call MN DNR commissioner Sarah Strommen at 651-259-5555 and tell her to protect the BWCA by canceling Twin Metals' leases. You can also tell Tim Walz the same. #BWCA #BoundaryWaters #MN #Minnesota #ElyMN

  15. Call MN DNR commissioner Sarah Strommen at 651-259-5555 and tell her to protect the BWCA by canceling Twin Metals' leases. You can also tell Tim Walz the same. #BWCA #BoundaryWaters #MN #Minnesota #ElyMN

  16. Call MN DNR commissioner Sarah Strommen at 651-259-5555 and tell her to protect the BWCA by canceling Twin Metals' leases. You can also tell Tim Walz the same. #BWCA #BoundaryWaters #MN #Minnesota #ElyMN

  17. Call MN DNR commissioner Sarah Strommen at 651-259-5555 and tell her to protect the BWCA by canceling Twin Metals' leases. You can also tell Tim Walz the same. They both have the power to cancel these leases, especially since the company failed to meet the production and royalty requirements mandated by those contracts!
    #BWCA #BoundaryWaters #MN #Minnesota #ElyMN

  18. 🆕 🎦 New learning videos published:

    ▪️ 3D Digitization & Visualization of Insects:
    tinyurl.com/mshey49n by @heethoff from the @TU

    ▪️ Learning to use Galaxy - Introduction and Hands-on:
    tinyurl.com/rz6ab5eu by Sarah Büker from the DSC of @unibremen

    👩‍🎓 Learn in your own pace with more than 30 videos in our channel: https://
    tinyurl.com/2sexbvm9

    🔜 New videos every few weeks.

    #winoda #OnlineCourse #Webinar #datascience #datamanagement #3d #insects #Galaxy

  19. Picture book author Sarah Speedie goes from strength to strength

    Sarah Speedie was a full-time stay-at-home mum and, when her children reached an age where she could carve out time for herself she chose to embark on a new journey – one that would reignite her passion for learning and storytelling. Sarah enrolled in the Writing Picture Books course at the Australian Writers' Centre.
    writerscentre.com.au/blog/sara

    #AlumniStudentsuccessstories #Fictionwriting #creative #graduatesuccess

  20. Picture book author Sarah Speedie goes from strength to strength

    Sarah Speedie was a full-time stay-at-home mum and, when her children reached an age where she could carve out time for herself she chose to embark on a new journey – one that would reignite her passion for learning and storytelling. Sarah enrolled in the Writing Picture Books course at the Australian Writers' Centre.
    writerscentre.com.au/blog/sara

    #AlumniStudentsuccessstories #Fictionwriting #creative #graduatesuccess

  21. Picture book author Sarah Speedie goes from strength to strength

    Sarah Speedie was a full-time stay-at-home mum and, when her children reached an age where she could carve out time for herself she chose to embark on a new journey – one that would reignite her passion for learning and storytelling. Sarah enrolled in the Writing Picture Books course at the Australian Writers' Centre.
    writerscentre.com.au/blog/sara

    #AlumniStudentsuccessstories #Fictionwriting #creative #graduatesuccess

  22. Picture book author Sarah Speedie goes from strength to strength

    Sarah Speedie was a full-time stay-at-home mum and, when her children reached an age where she could carve out time for herself she chose to embark on a new journey – one that would reignite her passion for learning and storytelling. Sarah enrolled in the Writing Picture Books course at the Australian Writers' Centre.
    writerscentre.com.au/blog/sara

    #AlumniStudentsuccessstories #Fictionwriting #creative #graduatesuccess

  23. Today I learned - that the Archbishop of Canterbury, during their confirmation ceremony does not pledge allegiance to God, nor to the Church but to the "Reigning King of England and all their heirs and successors' which.. yeah says a lot about British history right there.

    BBC News:
    "Dame Sarah Mullally has been officially confirmed as the 106th Archbishop of Canterbury in a ceremony rich with centuries-old tradition at St Paul's Cathedral."

    #BBC #Protestant #Church #Religion

  24. „Die Unverletzlichkeit der Wohnung gilt auch in Erstaufnahmeeinrichtungen, und Geflüchtete haben dort ein Recht auf Privatsphäre“ - Sarah Lincoln von @Freiheitsrechte

    Interview mit Ben von #AktionBleiberecht & #LEA-Watch über die Entscheidung des #VGH.

    rdl.de/beitrag/zimmer-erstaufn

  25. Like many #Episcopalians I was thrilled when Dame Sarah Mullally became the first female #archbishopofcanterbury I watched, with considerable interest, when she visited the Vatican and #PopeLeo knowing that their meeting was a diplomatic and theological threading of a needle for the leader of the #CatholicChurch even after the Pope’s historic meeting with #KingCharlesIII Now I am reading about U.S. Secretary of State #MarcoRubio ‘s trip to Rome and the contrast in welcome is sharp. Learn how and why in THE POPE DIDN’T PRAY WITH HIM notd.io/notes/6239800789827584

  26. Like many #Episcopalians I was thrilled when Dame Sarah Mullally became the first female #archbishopofcanterbury I watched, with considerable interest, when she visited the Vatican and #PopeLeo knowing that their meeting was a diplomatic and theological threading of a needle for the leader of the #CatholicChurch even after the Pope’s historic meeting with #KingCharlesIII Now I am reading about U.S. Secretary of State #MarcoRubio ‘s trip to Rome and the contrast in welcome is sharp. Learn how and why in THE POPE DIDN’T PRAY WITH HIM notd.io/notes/6239800789827584

  27. Like many #Episcopalians I was thrilled when Dame Sarah Mullally became the first female #archbishopofcanterbury I watched, with considerable interest, when she visited the Vatican and #PopeLeo knowing that their meeting was a diplomatic and theological threading of a needle for the leader of the #CatholicChurch even after the Pope’s historic meeting with #KingCharlesIII Now I am reading about U.S. Secretary of State #MarcoRubio ‘s trip to Rome and the contrast in welcome is sharp. Learn how and why in THE POPE DIDN’T PRAY WITH HIM notd.io/notes/6239800789827584

  28. Like many #Episcopalians I was thrilled when Dame Sarah Mullally became the first female #archbishopofcanterbury I watched, with considerable interest, when she visited the Vatican and #PopeLeo knowing that their meeting was a diplomatic and theological threading of a needle for the leader of the #CatholicChurch even after the Pope’s historic meeting with #KingCharlesIII Now I am reading about U.S. Secretary of State #MarcoRubio ‘s trip to Rome and the contrast in welcome is sharp. Learn how and why in THE POPE DIDN’T PRAY WITH HIM notd.io/notes/6239800789827584