Search
1000 results for “dataplane”
-
Stop the "Small File Syndrome" in your Data Lake. Learn how to implement Compaction, Z-Ordering, and automated maintenance in Databricks and Snowflake. https://hackernoon.com/the-silent-killer-of-data-lakes-solving-the-small-file-problem #datalake
-
Stop the "Small File Syndrome" in your Data Lake. Learn how to implement Compaction, Z-Ordering, and automated maintenance in Databricks and Snowflake. https://hackernoon.com/the-silent-killer-of-data-lakes-solving-the-small-file-problem #datalake
-
Stop the "Small File Syndrome" in your Data Lake. Learn how to implement Compaction, Z-Ordering, and automated maintenance in Databricks and Snowflake. https://hackernoon.com/the-silent-killer-of-data-lakes-solving-the-small-file-problem #datalake
-
Stop the "Small File Syndrome" in your Data Lake. Learn how to implement Compaction, Z-Ordering, and automated maintenance in Databricks and Snowflake. https://hackernoon.com/the-silent-killer-of-data-lakes-solving-the-small-file-problem #datalake
-
#Uber’s HiveSync team optimized Hadoop Distcp for multi-petabyte replication across hybrid cloud and on-prem data lakes.
✅ Task parallelization
✅ Uber jobs for small transfers
✅ Improved observabilityResult: 5× replication capacity & seamless on-prem-to-cloud migration.
Read more: https://bit.ly/4bwUUFt
#InfoQ #SoftwareArchitecture #DistributedSystems #Observability #DataLake
-
Data lakes are often thought of as just warehouses. But they don't have to be! Our #datalake provides inexpensive storage where logs stay searchable, preview-able & recoverable. Learn more about why this is a truly practical stance on managing data volume. graylog.org/post/how-to-... #CyberSecurity
How to Use Data Lakes to Reduc... -
#Tech #Data #DataAnalytics AI Is Ready, Your Workforce Isn't: Why AI ROI Falls Short https://www.gartner.com/en/podcasts/thinkcast?utm_source=dlvr.it&utm_medium=mastodon #ArtificialIntelligence #MachineLearning #DataPlatform
-
Automated product metrics monitoring on Google Cloud Platform using BigQuery and Cloud Functions for analysis and anomaly detection. https://hackernoon.com/why-our-analysts-stopped-chasing-dashboards-and-built-a-system-instead #dataplatform
-
In case you missed it, this thursday Marthe Moengen will share her #MicrosoftPurview knowledge online.
Sign up here to join us!
https://www.meetup.com/groningen-microsoft-data-meetup-groep/events/307283029/?eventOrigin=group_events_list -
RE: https://eduresearch.social/@bildungsgeschichte/115848671083379974
Im neuesten #Datapaper auf unserer Plattform https://bildungsgeschichte.de berichtet H. Heimblöckel, @ubosnabrueck, über das Projekt "FaDe:Live 1782–1891", das die #Geschichte der #Literaturvermittlung im #Deutschunterricht an höheren Schulen untersucht, und den Herausforderungen bei der digitalen Quellenaufbereitung.
#histed #Deutschdidaktik #FediLZ #DH #histodons #eduresearch #DigitalHistory #Didaktik #Fachunterricht #Schule #Korpora
-
Data & Corpus – La revue des données en SHS a le plaisir de vous annoncer la parution en ligne de son premier numéro entièrement consacré aux articles de données (data papers) : https://dc.episciences.org/volumes/1042
-
Bildungsgeschichte.de ist jetzt auch auf Mastodon. Wir freuen uns, Sie hier über neue Kolumnen und #Datapaper auf https://bildungsgeschichte.de/index.php/.de zu informieren
#histed #histodons #digitalhistory #dh -
📢 Semaine Numérique du réseau des Urfist du 17-21 mars 2025; des webinaires sur : Mast@don, Heurist, #datapaper, #Obsidian, R, l' #Osint, etc.
https://sygefor.reseau-urfist.fr/#/training?q=%7B%22keywords%22:%22SNDU2025%22%7D
#digitalscholarship #numérique #research #tools #openscience #méthodo #openaccess #PhD #learning -
Programme TDR : appel à communications pour des data papers autour des vecteurs de maladies humaines
https://www.gbif.org/news/70AuWTs68FiGEZEyALB1TV/third-call-for-data-papers-describing-datasets-on-vectors-of-human-diseases
#datapaper -
🆕 #DataPaper describes a publicly available #dataset related to 752 community tutelary shrines in 🇹🇼 #Taiwan, and establishes a baseline for future #research into #culturalheritage.
🗨 "As community #ritual assemblages, they are able to encode #data about a settlement’s #social, #political and #economic #history in their material composition, aesthetic choices, artefacts, displays and orientations."
-
This is a bit of an odd duck as a #DataPaper. Traditional research articles showcase some new development in the field and connect it to a reproducible line of evidence. #DataPapers on the other hand are relatively new and focus on the data collection where the data itself is the main development. The promise here is that the data is broad enough and robust enough to be of general interest to other researchers... let's dig in. 3/n
-
http://bildungsgeschichte.de ist nach einer längeren technisch bedingten Abwesenheit wieder online. Es gibt zwar noch technische Einschränkungen, aber auch ein neues #datapaper von Maret Nieländer: "Historische Schulbücher mit digitalen Werkzeugen untersuchen"
https://doi.org/10.25523/32552.a -
CW: A second chance awaits for sharing #OpenData #FAIRdata on #OneHealth #biodiversity related to human vector-borne diseases.
Prepare your dataset on wild vectors of human diseases, draft submit your #dataPaper by 30 April, and—if @GigaScience's #GigaByteJournal accepts your manuscript, #TDR at @WHO will pick up the US$400 article processing charge!
https://www.gbif.org/vectors-call2 -
Dataland, the world’s first AI art museum, will open in 2025 at Frank Gehry’s The Grand LA in downtown Los Angeles. The museum, led by Refik Anadol Studio, will showcase immersive AI-driven art. @refikanadol
-
Refik Anadol Studio introduces DATALAND - A Web3 platform merging AI arts and environmental advocacy (in their own words).
A "Large Nature Model", an innovative tool for environmental awareness.
https://www.dataland.art/
#DATALAND #LargeNatureModel -
#Dataland
Pour ceux qui préfèrent le ⬇️ télécharger (3.17GB) en utilisant un client torrent, voici lien magnet :https://framabin.org/p/?fced85ef07e68459#R1Zhqm7PsKepGJZOUJLJJDF2RK9YtV2yY6fHJocuAmU=
-
The team have over the last few weeks been focused on some Data Platform projects, Hydrology enhancements, Regulated Products enhancements and some of our own technology product updates.
Thank you everyone that we’ve met with over the last few week.
www.epimorphics.com #DataPlatform #ThisWeek #DataSolutions #DataDriven #LinkedData #Hydrology #RegulatedProducts -
Want to implement CI/CD for Microsoft Fabric? On 2026-02-19, Kev Chant walks us through Azure DevOps integration with Fabric. We will be covering Git workflows, branching strategies, and deployment approaches for Data Warehouses and SQL databases. https://www.meetup.com/fabricpowerbiwales/events/312430627
#MicrosoftFabric #AzureDevOps #DataPlatform -
We’ve updated a number of our core products including #DataPlatform, Agora #DataCatalog, #MeasurementStore, & #ConceptStore + other #reference #DataManagement tools. Looking for #ConnectedData tech to support your #DataArchitecture then we’d love to talk. www.epimorphics.com
-
[登壇レポート]Apache Icebergと超えていくデータレイクの限界 -S3とSnowflake活用事例-でSnowflake×Icebergの機能と活用例についてお話しました #datalake_findy
https://dev.classmethod.jp/articles/speeking-report-findy-iceberg-s3-snowflake/ -
Improving Security Data Lake Efficiency with Log Filtering: https://jacknaglieri.substack.com/p/filtering
-
A great, practical article on #DataLakes by @magicaltrout
-
@stephensmith With the exponential expansion of info locked up in unstructured data in the form of images, video, audio, documents, using the traditional monolithic data warehouse based on system generated normalised data to gain organisational insights (via the so-called Inmon method) became rather outmoded #DataLakes #DataWarehouse #UnstructuredData
-
@stephensmith data warehouses store _structured_ data i.e. traditional, rigid table / row / columnar information usually produced by systems, for purposes of read-heavy analytics processing, e.g. customer data with cust id, address lines, name, forename, phone numbers, etc; data _lakes_ allow mass storage of _unstructured_ data, usually generated by humans, also for the purposes of analytics processing. #DataLakes #DataWarehouse #UnstructuredData
-
@thomasfuchs my career went from #punchcards to #datalakes. Thanks for the memories!!