“dataplane” — Fediverse search results on home.social

HackerNoon @[email protected] · 2026-03-13 · 01:02 UTC

Stop the "Small File Syndrome" in your Data Lake. Learn how to implement Compaction, Z-Ordering, and automated maintenance in Databricks and Snowflake. https://hackernoon.com/the-silent-killer-of-data-lakes-solving-the-small-file-problem #datalake

#datalake

HackerNoon @[email protected] · 2026-03-13 · 01:02 UTC

Stop the "Small File Syndrome" in your Data Lake. Learn how to implement Compaction, Z-Ordering, and automated maintenance in Databricks and Snowflake. https://hackernoon.com/the-silent-killer-of-data-lakes-solving-the-small-file-problem #datalake

#datalake

HackerNoon @hackernoon · 2026-03-13 · 01:02 UTC

Stop the "Small File Syndrome" in your Data Lake. Learn how to implement Compaction, Z-Ordering, and automated maintenance in Databricks and Snowflake. https://hackernoon.com/the-silent-killer-of-data-lakes-solving-the-small-file-problem #datalake

#datalake

HackerNoon @[email protected] · 2026-03-13 · 01:02 UTC

Stop the "Small File Syndrome" in your Data Lake. Learn how to implement Compaction, Z-Ordering, and automated maintenance in Databricks and Snowflake. https://hackernoon.com/the-silent-killer-of-data-lakes-solving-the-small-file-problem #datalake

#datalake

InfoQ @[email protected] · 2026-03-06 · 13:18 UTC

#Uber’s HiveSync team optimized Hadoop Distcp for multi-petabyte replication across hybrid cloud and on-prem data lakes.

✅ Task parallelization
✅ Uber jobs for small transfers
✅ Improved observability

Result: 5× replication capacity & seamless on-prem-to-cloud migration.

Read more: https://bit.ly/4bwUUFt

#InfoQ #SoftwareArchitecture #DistributedSystems #Observability #DataLake

#datalake #observability #distributedsystems #softwarearchitecture #infoq #uber

Graylog @[email protected] · 2025-11-21 · 23:58 UTC

Data lakes are often thought of as just warehouses. But they don't have to be! Our #datalake provides inexpensive storage where logs stay searchable, preview-able & recoverable. Learn more about why this is a truly practical stance on managing data volume. graylog.org/post/how-to-... #CyberSecurity

How to Use Data Lakes to Reduc...

#datalake #cybersecurity

Craig Brown, PhD @[email protected] · 2026-03-30 · 04:24 UTC

#Tech #Data #DataAnalytics AI Is Ready, Your Workforce Isn't: Why AI ROI Falls Short https://www.gartner.com/en/podcasts/thinkcast?utm_source=dlvr.it&utm_medium=mastodon #ArtificialIntelligence #MachineLearning #DataPlatform

#dataplatform #tech #data #dataanalytics #artificialintelligence #machinelearning

HackerNoon @[email protected] · 2025-09-25 · 05:14 UTC

Automated product metrics monitoring on Google Cloud Platform using BigQuery and Cloud Functions for analysis and anomaly detection. https://hackernoon.com/why-our-analysts-stopped-chasing-dashboards-and-built-a-system-instead #dataplatform

#dataplatform

2meterdba | Reitse Eskens @[email protected] · 2025-04-22 · 07:39 UTC

In case you missed it, this thursday Marthe Moengen will share her #MicrosoftPurview knowledge online.

Sign up here to join us!
https://www.meetup.com/groningen-microsoft-data-meetup-groep/events/307283029/?eventOrigin=group_events_list

#Meetup #Microsoft #DataPlatform

#dataplatform #microsoft #meetup #microsoftpurview

BBF des DIPF @[email protected] · 2026-01-08 · 09:08 UTC

RE: https://eduresearch.social/@bildungsgeschichte/115848671083379974

Im neuesten #Datapaper auf unserer Plattform https://bildungsgeschichte.de berichtet H. Heimblöckel, @ubosnabrueck, über das Projekt "FaDe:Live 1782–1891", das die #Geschichte der #Literaturvermittlung im #Deutschunterricht an höheren Schulen untersucht, und den Herausforderungen bei der digitalen Quellenaufbereitung.

#histed #Deutschdidaktik #FediLZ #DH #histodons #eduresearch #DigitalHistory #Didaktik #Fachunterricht #Schule #Korpora

#datapaper #geschichte #literaturvermittlung #deutschunterricht #histed #deutschdidaktik

Nicolas Fressengeas @[email protected] · 2025-12-16 · 16:00 UTC

Data & Corpus – La revue des données en SHS a le plaisir de vous annoncer la parution en ligne de son premier numéro entièrement consacré aux articles de données (data papers) : https://dc.episciences.org/volumes/1042

#datapaper
#scienceouverte
#shs
#DiamondOpenAccess

#datapaper #scienceouverte #shs #diamondopenaccess

bildungsgeschichte.de @[email protected] · 2025-09-01 · 15:25 UTC

Bildungsgeschichte.de ist jetzt auch auf Mastodon. Wir freuen uns, Sie hier über neue Kolumnen und #Datapaper auf https://bildungsgeschichte.de/index.php/.de zu informieren
#histed #histodons #digitalhistory #dh

#datapaper #histed #histodons #digitalhistory #dh

(((@amarois))) @[email protected] · 2025-01-30 · 12:21 UTC

📢 Semaine Numérique du réseau des Urfist du 17-21 mars 2025; des webinaires sur : Mast@don, Heurist, #datapaper, #Obsidian, R, l' #Osint, etc.
https://sygefor.reseau-urfist.fr/#/training?q=%7B%22keywords%22:%22SNDU2025%22%7D
#digitalscholarship #numérique #research #tools #openscience #méthodo #openaccess #PhD #learning

#datapaper #obsidian #osint #digitalscholarship #numerique #research

Science ouverte UnivRennes @[email protected] · 2025-01-17 · 11:08 UTC

Programme TDR : appel à communications pour des data papers autour des vecteurs de maladies humaines
https://www.gbif.org/news/70AuWTs68FiGEZEyALB1TV/third-call-for-data-papers-describing-datasets-on-vectors-of-human-diseases
#datapaper

#datapaper

RIO Journal @[email protected] · 2024-07-17 · 14:19 UTC

🆕 #DataPaper describes a publicly available #dataset related to 752 community tutelary shrines in 🇹🇼 #Taiwan, and establishes a baseline for future #research into #culturalheritage.

🗨 "As community #ritual assemblages, they are able to encode #data about a settlement’s #social, #political and #economic #history in their material composition, aesthetic choices, artefacts, displays and orientations."

👉 See: https://doi.org/10.3897/rio.10.e127510

#humanities #socialscience

#datapaper #dataset #taiwan #research #culturalheritage #ritual

Kathe Todd-Brown @[email protected] · 2023-12-30 · 13:46 UTC

This is a bit of an odd duck as a #DataPaper. Traditional research articles showcase some new development in the field and connect it to a reproducible line of evidence. #DataPapers on the other hand are relatively new and focus on the data collection where the data itself is the main development. The promise here is that the data is broad enough and robust enough to be of general interest to other researchers... let's dig in. 3/n

#datapaper #datapapers

Lars Müller @[email protected] · 2023-03-30 · 05:32 UTC

http://bildungsgeschichte.de ist nach einer längeren technisch bedingten Abwesenheit wieder online. Es gibt zwar noch technische Einschränkungen, aber auch ein neues #datapaper von Maret Nieländer: "Historische Schulbücher mit digitalen Werkzeugen untersuchen"
https://doi.org/10.25523/32552.a

#datapaper

GBIF 🌱 @[email protected] · 2023-02-21 · 09:08 UTC

CW: A second chance awaits for sharing #OpenData #FAIRdata on #OneHealth #biodiversity related to human vector-borne diseases.

Prepare your dataset on wild vectors of human diseases, draft submit your #dataPaper by 30 April, and—if @GigaScience's #GigaByteJournal accepts your manuscript, #TDR at @WHO will pick up the US$400 article processing charge!
https://www.gbif.org/vectors-call2

#datapaper #gigabytejournal #tdr

DailyArt.News @[email protected] · 2024-09-30 · 07:03 UTC

Dataland, the world’s first AI art museum, will open in 2025 at Frank Gehry’s The Grand LA in downtown Los Angeles. The museum, led by Refik Anadol Studio, will showcase immersive AI-driven art. @refikanadol

#refikanadol #dataland #museum #dailyartnews

https://buff.ly/4eFqFMg

#refikanadol #dataland #museum #dailyartnews

Harald Klinke @[email protected] · 2024-01-15 · 09:38 UTC

Refik Anadol Studio introduces DATALAND - A Web3 platform merging AI arts and environmental advocacy (in their own words).
A "Large Nature Model", an innovative tool for environmental awareness.
https://www.dataland.art/
#DATALAND #LargeNatureModel

#dataland #largenaturemodel

Troll @[email protected] · 2018-11-22 · 13:48 UTC

#Dataland
Pour ceux qui préfèrent le ⬇️ télécharger (3.17GB) en utilisant un client torrent, voici lien magnet :

https://framabin.org/p/?fced85ef07e68459#R1Zhqm7PsKepGJZOUJLJJDF2RK9YtV2yY6fHJocuAmU=

#dataland

Epimorphics @[email protected] · 2024-04-19 · 14:30 UTC

The team have over the last few weeks been focused on some Data Platform projects, Hydrology enhancements, Regulated Products enhancements and some of our own technology product updates.

Thank you everyone that we’ve met with over the last few week.
www.epimorphics.com #DataPlatform #ThisWeek #DataSolutions #DataDriven #LinkedData #Hydrology #RegulatedProducts

#dataplatform #thisweek #datasolutions #datadriven #linkeddata #hydrology

Justin Bird @[email protected] · 2026-02-09 · 17:12 UTC

Want to implement CI/CD for Microsoft Fabric? On 2026-02-19, Kev Chant walks us through Azure DevOps integration with Fabric. We will be covering Git workflows, branching strategies, and deployment approaches for Data Warehouses and SQL databases. https://www.meetup.com/fabricpowerbiwales/events/312430627
#MicrosoftFabric #AzureDevOps #DataPlatform

#dataplatform #azuredevops #microsoftfabric

Epimorphics @[email protected] · 2024-07-10 · 13:00 UTC

We’ve updated a number of our core products including #DataPlatform, Agora #DataCatalog, #MeasurementStore, & #ConceptStore + other #reference #DataManagement tools. Looking for #ConnectedData tech to support your #DataArchitecture then we’d love to talk. www.epimorphics.com

#dataplatform #datacatalog #measurementstore #conceptstore #reference #datamanagement

:rss: DevelopersIO @[email protected] · 2025-02-28 · 04:21 UTC

[登壇レポート]Apache Icebergと超えていくデータレイクの限界 -S3とSnowflake活用事例-でSnowflake×Icebergの機能と活用例についてお話しました #datalake_findy
https://dev.classmethod.jp/articles/speeking-report-findy-iceberg-s3-snowflake/

#dev_classmethod #Snowflake #Apache_Iceberg

#datalake_findy #dev_classmethod #snowflake #apache_iceberg

Tedi Heriyanto @[email protected] · 2024-05-04 · 02:37 UTC

Improving Security Data Lake Efficiency with Log Filtering: https://jacknaglieri.substack.com/p/filtering

#datalakes #siem #logging #threatdetection

Joseph A di Paolantonio @[email protected] · 2023-11-22 · 23:25 UTC

A great, practical article on #DataLakes by @magicaltrout

https://www.thedatasciencedossier.com/p/dossier-06

#datalakes

Sherbs @[email protected] · 2023-06-04 · 17:01 UTC

@stephensmith With the exponential expansion of info locked up in unstructured data in the form of images, video, audio, documents, using the traditional monolithic data warehouse based on system generated normalised data to gain organisational insights (via the so-called Inmon method) became rather outmoded #DataLakes #DataWarehouse #UnstructuredData

#datalakes #datawarehouse #unstructureddata

Sherbs @[email protected] · 2023-06-04 · 17:00 UTC

@stephensmith data warehouses store _structured_ data i.e. traditional, rigid table / row / columnar information usually produced by systems, for purposes of read-heavy analytics processing, e.g. customer data with cust id, address lines, name, forename, phone numbers, etc; data _lakes_ allow mass storage of _unstructured_ data, usually generated by humans, also for the purposes of analytics processing. #DataLakes #DataWarehouse #UnstructuredData

#datalakes #datawarehouse #unstructureddata

PC @[email protected] · 2022-12-30 · 16:02 UTC

@thomasfuchs my career went from #punchcards to #datalakes. Thanks for the memories!!

#datalakes #punchcards

Search