Testing framework for Databricks notebooks
-
Updated
Apr 20, 2024 - Python
Testing framework for Databricks notebooks
Apache Spark Connector for Azure Cosmos DB
Azure Databricks MLOps sample for Python based source code using MLflow without using MLflow Project.
Analyzing the safety (311) dataset published by Azure Open Datasets for Chicago, Boston and New York City using SparkR, SParkSQL, Azure Databricks, visualization using ggplot2 and leaflet. Focus is on descriptive analytics, visualization, clustering, time series forecasting and anomaly detection.
Azure Databricks - Advent of 2020 Blogposts
code, labs and lectures for the course
Black for Databricks notebooks
Databricks Data Engineer Associate Certification Lab: End-to-end hands-on project covering Auto Loader, Medallion Architecture, SCD Type 2, Unity Catalog governance, and Databricks Jobs orchestration. Build a production-grade pipeline on Databricks Free Edition.
Notebooks to learn Databricks Lakehouse Platform
Databricks. Incremental data processing, task orchestration, and production job monitoring.
Databricks DLT Apparel Pipeline Project: Learn medallion architecture, streaming, and data engineering with Delta Live Tables. Includes synthetic data, step-by-step guide, and certification prep.
Databricks Add-on for Splunk
Repository of notebooks and related collateral used in the Databricks Demo Hub, showing how to use Databricks, Delta Lake, MLflow, and more.
Orchestrate your Databricks notebooks in Airflow and execute them as Databricks Workflows
Revolutionary AI ETL with Medallion Architecture: Zero-touch autonomous & HITL pipelines on Databricks
A solution for on-demand training and serving of Machine Learning models, using Azure Databricks and MLflow
A data pipeline project build on databricks and azure to demostrate lifecycle of a cloud data project.
Delta Lake Optimization Project: Hands‑on lab to explore partitioning, Z‑Ordering, compaction (manual & auto), Liquid Clustering, and VACUUM using a synthetic sales dataset in Databricks. Includes a step‑by‑step notebook to measure file scans, bytes read, and query performance for each optimization.
Are you like me , a Senior Data Scientist, wanting to learn more about how to approach DevOps, specifically when you using Databricks (workspaces, notebooks, libraries etc) ? Set up using @Azure @databricks
Using Azure Databricks (Spark) for ML, this is the //build 2019 repository with homework examples, code and notebooks
Add a description, image, and links to the databricks-notebooks topic page so that developers can more easily learn about it.
To associate your repository with the databricks-notebooks topic, visit your repo's landing page and select "manage topics."