#datalakes — Public Fediverse posts
Live and recent posts from across the Fediverse tagged #datalakes, aggregated by home.social.
-
Data is not storage—it’s a product. What are you building? #DataProducts #DataLakes #DataMonetisation #AI #Cloud #BigData #DigitalTransformation #CIO #CTO #Leadership #Innovation #DataStrategy #AIethics
https://medium.com/@sanjay.mohindroo66/from-data-lakes-to-value-streams-building-data-products-that-matter-4fbd6b1c1ff2 -
Data is not storage—it’s a product. What are you building? #DataProducts #DataLakes #DataMonetisation #AI #Cloud #BigData #DigitalTransformation #CIO #CTO #Leadership #Innovation #DataStrategy #AIethics
https://medium.com/@sanjay.mohindroo66/from-data-lakes-to-value-streams-building-data-products-that-matter-4fbd6b1c1ff2 -
🚀 Quack, quack! Introducing DuckLake: the "innovative" format that reinvents the wheel by combining data lakes and catalogs. 🎉 It's like your SQL database got lost in a pond, but hey, at least it's open-source and uses #Parquet files! 🦆💾
https://ducklake.select/ #DuckLake #DataLakes #OpenSource #Innovation #HackerNews #ngated -
Security data lakes and data warehouses are repositories that enable organizations to store large amounts of security data — typically types not immediately required for search and analysis. Is it time for your org to build a security data lake strategy? 🏗️ Let's explore some of the important details about security data management. 👀
Read on to learn about data lake architecture, the benefits of using #security data lakes, some key strategy considerations, and a cost-effective solution to security data management. 💵 🔒 🙌
https://graylog.org/post/security-data-lake-strategy/ #datalakes #cybersecurity
-
"Data lakes have been around for years, and they may hold the key to efficient and effective #SIEMs by forcing behavioral changes based on economics." 💡
In this insightful article, #Graylog's Joshua Ziel explains what orgs need to know when it comes to how data and data storage affect cost and the resulting impact on their #cybersecurity strategy.
Wondering what the future might hold when it comes to the role of data lakes in #SIEM? Read on to find out. 📖
https://graylog.info/3Pm4aBu #security #datalakes -
Magnifique #stage #Master Projet PICLETTERS « Pablo Picasso en toutes lettres » Modélisation et enrichissement des métadonnées des correspondances de Picasso🤩 au @labo_eric #Lyon1 #Lyon2 https://eric.msh-lse.fr/15-12-23-stage-modelisation-et-enrichissement-des-metadonnees-des-correspondances-de-picasso-web-machinelearning/
#datalakes #bigdata #digitalhumanities #IA #machinelearning #metadata #semanticweb #web -
@briankrebs The best way to prevent #dataexfiltration when breached is not to collect or store unnecessary data in the first place. That makes many of the current spate of #databreaches avoidable, self-inflicted incidents for which large companies are never held accountable in any truly meaningful way.
You're spot on when you say that #databrokers rely on large #datalakes of sensitive data they don't need directly. They also rely on large data sets where any typical datum may be harmless in itself, but often becomes sensitive or dangerous when aggregated, and often exponentially more so when connected to intrinsically sensitive data such as #PII, #PHI, or identity.
Setting aside the financial incentives and lack of accountability for the data brokers, how do #businessleaders, #regulatoryagencies, and #electedpoliticians justify this state of affairs to you? It's not like the public and private sectors don't also have data they want to protect, so why allow this shadow industry to prosper? This seems even more mystifying when it's so clearly a double-edged sword even for the brokerages' paying customers!
-
Improving Security Data Lake Efficiency with Log Filtering: https://jacknaglieri.substack.com/p/filtering
-
In another form of platform vs. point tools, Anvilogic "deconstructs" SIEMs by separating analysis from data collection & storage, thus enabling organizations to separately optimize sensors, data collection, data lakes, storage, and analysis.
@Anvilogic raised $45M series C funding to expand GenAI use cases as well as marketing and sales.
#SIEM #GenAI #AI #cybersecurity #security #funding #datalakes
https://www.finsmes.com/2024/04/anvilogic-closes-45m-series-c-funding.html
-
A great, practical article on #DataLakes by @magicaltrout
-
🔥⏲️ Fudge Sunday "Are You Gonna Go Parquet" A look at the past, present, and future of Apache Parquet
#apacheiceberg #apachespark #prestodb #prestosql #trino #aiops #mlops #artificialintelligence #ai #aiforgood #aiforall #aiandbusiness #datalake #datalakehouse #datalakes #insights #dataengineering #realtimeanalytics #realtimedata #dataintegration #platformengineering #watsonx #devx #developerexperience #newsletter #newsletters
-
09/2023 – Offre de #stage : Lac de données et intelligence artificielle pour la gestion des données multimédia #datalakes #ia
Dans le cadre du projet ESPHAISTOSS (projet du Chantier scientifique CNRS/Ministère de la Culture/Notre-Dame)
Offre stage-Master-Recherche_ESHAISTOSSTélécharger
-
@stephensmith With the exponential expansion of info locked up in unstructured data in the form of images, video, audio, documents, using the traditional monolithic data warehouse based on system generated normalised data to gain organisational insights (via the so-called Inmon method) became rather outmoded #DataLakes #DataWarehouse #UnstructuredData
-
@stephensmith data warehouses store _structured_ data i.e. traditional, rigid table / row / columnar information usually produced by systems, for purposes of read-heavy analytics processing, e.g. customer data with cust id, address lines, name, forename, phone numbers, etc; data _lakes_ allow mass storage of _unstructured_ data, usually generated by humans, also for the purposes of analytics processing. #DataLakes #DataWarehouse #UnstructuredData
-
From #DataLakes to #DataMesh: A Guide to the Latest Enterprise Data Architecture | by Col Jung | May, 2023 | Towards Data Science
-
Isaac Sacolick explains how CEOs and business leaders know little about data stores, why they need data meshes, fabrics, and clouds, or how data lakes are used to ingest structured and unstructured data. https://www.infoworld.com/article/3695536/how-to-explain-data-meshes-fabrics-and-clouds.html#tk.rss_all #datameshes #datafabrics #datalakes #softcorpremium
-
Isaac Sacolick explains how CEOs and business leaders know little about data stores, why they need data meshes, fabrics, and clouds, or how data lakes are used to ingest structured and unstructured data. https://www.infoworld.com/article/3695536/how-to-explain-data-meshes-fabrics-and-clouds.html#tk.rss_all #datameshes #datafabrics #datalakes #softcorpremium
-
Isaac Sacolick explains how CEOs and business leaders know little about data stores, why they need data meshes, fabrics, and clouds, or how data lakes are used to ingest structured and unstructured data. https://www.infoworld.com/article/3695536/how-to-explain-data-meshes-fabrics-and-clouds.html#tk.rss_all #datameshes #datafabrics #datalakes #softcorpremium
-
Isaac Sacolick explains how CEOs and business leaders know little about data stores, why they need data meshes, fabrics, and clouds, or how data lakes are used to ingest structured and unstructured data. https://www.infoworld.com/article/3695536/how-to-explain-data-meshes-fabrics-and-clouds.html#tk.rss_all #datameshes #datafabrics #datalakes #softcorpremium
-
@thomasfuchs my career went from #punchcards to #datalakes. Thanks for the memories!!
-
@cybersecboardrm "But ABE offers a solution in such scenarios by enabling companies to make data available to employees who need access to it, while protecting such sensitive information." - If this works out as promised, this could be solving some interesting problems. #Privacy for #datalakes e.g. is "especially in addressing the challenge of data lakes"