Skip to content

skills(pipelines): port DLT migration guide from a-d-k (phase 1)#85

Draft
jamesbroadhead wants to merge 1 commit into
mainfrom
jb/pipelines-port-phase1
Draft

skills(pipelines): port DLT migration guide from a-d-k (phase 1)#85
jamesbroadhead wants to merge 1 commit into
mainfrom
jb/pipelines-port-phase1

Conversation

@jamesbroadhead
Copy link
Copy Markdown
Contributor

Summary

Phase 1 of d-a-s #73's TODO #5 — port databricks-spark-declarative-pipelines content from databricks-solutions/ai-dev-kit into stable skills/databricks-pipelines/.

Stable's databricks-pipelines covers feature reference (decision tree, common traps, format options, per-feature × per-language refs). a-d-k's version covers scaffolding, migration, and language-selection — different surface; non-trivial overlap. Plan is to add high-value a-d-k content as references/ files, not rewrite SKILL.md.

This first commit ports the DLT → SDP migration guide, the cleanest piece: no MCP-tool refs to strip, no doc-URL prefixes to rewrite, no API/SDK divergence vs current main.

Changes

  • New: skills/databricks-pipelines/references/dlt-migration.md — both migration paths (DLT Python → SDP Python via pyspark.pipelines, DLT Python → SDP SQL) with side-by-side conversions for decorators, reads, expectations, CDC/SCD, and partitioning → liquid clustering.
  • SKILL.md: bump version to 0.2.0; new "Migrating from DLT" section pointing at the new reference.

Follow-up phases (this branch, separate commits)

  • Workflow A/B/C decision matrixdatabricks pipelines init for standalone projects vs DAB integration vs MCP workflows (from a-d-k's 1-project-initialization.md).
  • Per-language performance reference — language-selection rules + when to pick Python vs SQL (from a-d-k's SKILL.md head).
  • Per-language reference content — anything in a-d-k's python/ and sql/ subdirectories not already covered by stable's existing per-feature references.

All follow-up content will land here as commits, then this PR ships once the dev-side surface is complete.

Test plan

  • python3 scripts/skills.py generate clean.
  • python3 scripts/skills.py validate passes.
  • CI green on this branch.
  • Owner review (@lennartkats-db / @camielstee-db per CODEOWNERS).

This pull request and its description were written by Claude.

Phase 1 of d-a-s #73's TODO #5 — port a-d-k's
databricks-spark-declarative-pipelines content into stable
skills/databricks-pipelines/. Adds references/dlt-migration.md
covering both migration paths (DLT Python → SDP Python via the modern
pyspark.pipelines API, and DLT Python → SDP SQL) with side-by-side
conversions for decorators, reads, expectations, CDC/SCD, and
partitioning → liquid clustering.

Source clean — no MCP-tool refs to strip, no docs.databricks.com URLs
to rewrite.

SKILL.md updates:
- bump version to 0.2.0
- new "Migrating from DLT" section pointing at the reference

Subsequent phases (separate commits) port the remaining a-d-k content:
workflow A/B/C decision matrix (project initialization), per-language
performance reference, language-selection rules.

Co-authored-by: Isaac
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant