jpskill.com
💼 ビジネス コミュニティ

azure-databricks

Azure Databricks開発全般に関する専門知識を提供し、トラブルシューティングからアーキテクチャ設計、セキュリティ、デプロイまで幅広く支援するSkill。

📜 元の英語説明(参考)

Expert knowledge for Azure Databricks development including troubleshooting, best practices, decision making, architecture & design patterns, limits & quotas, security, configuration, integrations & coding patterns, and deployment. Use when using Unity Catalog, Delta/Lakeflow pipelines, Spark SQL, ML/LLM serving, or AI agents on Azure Databricks, and other Azure Databricks related development tasks. Not for Azure Synapse Analytics (use azure-synapse-analytics), Azure HDInsight (use azure-hdinsight), Azure Machine Learning (use azure-machine-learning), Azure Data Factory (use azure-data-factory).

🇯🇵 日本人クリエイター向け解説

一言でいうと

Azure Databricks開発全般に関する専門知識を提供し、トラブルシューティングからアーキテクチャ設計、セキュリティ、デプロイまで幅広く支援するSkill。

※ jpskill.com 編集部が日本のビジネス現場向けに補足した解説です。Skill本体の挙動とは独立した参考情報です。

⚠️ ダウンロード・利用は自己責任でお願いします。当サイトは内容・動作・安全性について責任を負いません。

🎯 このSkillでできること

下記の説明文を読むと、このSkillがあなたに何をしてくれるかが分かります。Claudeにこの分野の依頼をすると、自動で発動します。

📦 インストール方法 (3ステップ)

  1. 1. 上の「ダウンロード」ボタンを押して .skill ファイルを取得
  2. 2. ファイル名の拡張子を .skill から .zip に変えて展開(macは自動展開可)
  3. 3. 展開してできたフォルダを、ホームフォルダの .claude/skills/ に置く
    • · macOS / Linux: ~/.claude/skills/
    • · Windows: %USERPROFILE%\.claude\skills\

Claude Code を再起動すれば完了。「このSkillを使って…」と話しかけなくても、関連する依頼で自動的に呼び出されます。

詳しい使い方ガイドを見る →
最終更新
2026-05-17
取得日時
2026-05-17
同梱ファイル
1
📖 Claude が読む原文 SKILL.md(中身を展開)

この本文は AI(Claude)が読むための原文(英語または中国語)です。日本語訳は順次追加中。

Azure Databricks Skill

This skill provides expert guidance for Azure Databricks. Covers troubleshooting, best practices, decision making, architecture & design patterns, limits & quotas, security, configuration, integrations & coding patterns, and deployment. It combines local quick-reference content with remote documentation fetching capabilities.

How to Use This Skill

IMPORTANT for Agent: Use the Category Index below to locate relevant sections. For categories with line ranges (e.g., L35-L120), use read_file with the specified lines. For categories with file links (e.g., [security.md](security.md)), use read_file on the linked reference file

IMPORTANT for Agent: If metadata.generated_at is more than 3 months old, suggest the user pull the latest version from the repository. If mcp_microsoftdocs tools are not available, suggest the user install it: Installation Guide

This skill requires network access to fetch documentation content:

  • Preferred: Use mcp_microsoftdocs:microsoft_docs_fetch with query string from=learn-agent-skill. Returns Markdown.
  • Fallback: Use fetch_webpage with query string from=learn-agent-skill&accept=text/markdown. Returns Markdown.

Category Index

Category Location Description
Troubleshooting L37-L139 Diagnosing and fixing Databricks issues: cluster startup/termination, Spark and SQL errors, connectors/Lakeflow ingestion, VS Code/CLI/Connect, model serving, AI agents, and performance debugging.
Best Practices L140-L308 Best-practice guidance for Databricks architecture, performance, cost, governance, streaming, Lakeflow, ML/LLM/MLOps, Vector Search, BI, and operational reliability across the lakehouse.
Decision Making L309-L400 Guides for architectural and migration decisions: choosing compute, runtimes, Unity Catalog, budgets, networking, ML/LLM options, ingestion, and lakehouse deployment patterns.
Architecture & Design Patterns L401-L439 Architectural blueprints and patterns for Databricks: lakehouse, networking, storage, HA/DR, governance, performance, cost, streaming, Lakebase, ML/MLOps, RAG, agents, and IDP pipelines.
Limits & Quotas limits-quotas.md Quotas, limits, and constraints for Azure Databricks compute, connectors, Lakeflow, Lakebase, model serving, tokens, APIs, data types, dashboards, and Unity Catalog resources.
Security security.md Security, identity, and compliance for Azure Databricks: authN/authZ, Unity Catalog/ABAC, networking, encryption, secrets, audit logs, compliance controls, and secure external/ingestion integrations.
Configuration configuration.md Configuring and administering Azure Databricks: accounts, workspaces, security, networking, compute, jobs, data/UC/Delta/Lakeflow, ML/serving, agents, Marketplace, and SQL/runtime settings.
Integrations & Coding Patterns integrations.md Patterns and APIs for integrating Databricks with external data systems, tools, and AI/ML frameworks, plus detailed PySpark/SQL function references and Lakehouse Federation connectivity.
Deployment deployment.md Deploying and operating Databricks workspaces, CI/CD, IaC, model/feature serving, AI agents, dashboards, and migrations (Unity Catalog, Feature Store, routing, charts) across Azure environments

Troubleshooting

Topic URL
Interpret Azure Databricks diagnostic audit log events https://learn.microsoft.com/en-us/azure/databricks/admin/account-settings/audit-logs
Troubleshoot Azure Databricks compute startup issues https://learn.microsoft.com/en-us/azure/databricks/compute/troubleshooting/
Resolve Databricks classic compute termination error codes https://learn.microsoft.com/en-us/azure/databricks/compute/troubleshooting/cluster-error-codes
Debug Spark applications using Databricks Spark UI https://learn.microsoft.com/en-us/azure/databricks/compute/troubleshooting/debugging-spark-ui
Troubleshoot Apache Kafka usage on Databricks https://learn.microsoft.com/en-us/azure/databricks/connect/streaming/kafka/faq
Diagnose and fix common Delta Sharing errors https://learn.microsoft.com/en-us/azure/databricks/delta-sharing/troubleshooting
Troubleshoot common Databricks CLI issues https://learn.microsoft.com/en-us/azure/databricks/dev-tools/cli/troubleshooting
Diagnose and fix Databricks Connect Python issues https://learn.microsoft.com/en-us/azure/databricks/dev-tools/databricks-connect/python/troubleshooting
Diagnose and fix Databricks Connect Scala issues https://learn.microsoft.com/en-us/azure/databricks/dev-tools/databricks-connect/scala/troubleshooting
Troubleshoot common Databricks Terraform provider errors https://learn.microsoft.com/en-us/azure/databricks/dev-tools/terraform/troubleshoot
Resolve common issues with Databricks VS Code extension https://learn.microsoft.com/en-us/azure/databricks/dev-tools/vscode-ext/faqs
Troubleshoot Databricks VS Code extension errors https://learn.microsoft.com/en-us/azure/databricks/dev-tools/vscode-ext/troubleshooting
Resolve ARITHMETIC_OVERFLOW errors in Databricks https://learn.microsoft.com/en-us/azure/databricks/error-messages/arithmetic-overflow-error-class
Handle CAST_INVALID_INPUT errors in Databricks https://learn.microsoft.com/en-us/azure/databricks/error-messages/cast-invalid-input-error-class
Diagnose DC_GA4_RAW_DATA_ERROR in GA4 connector https://learn.microsoft.com/en-us/azure/databricks/error-messages/dc-ga4-raw-data-error-error-class
Understand DC_SFDC_API_ERROR in Databricks connectors https://learn.microsoft.com/en-us/azure/databricks/error-messages/dc-sfdc-api-error-error-class
Diagnose DC_SQLSERVER_ERROR in SQL Server connector https://learn.microsoft.com/en-us/azure/databricks/error-messages/dc-sqlserver-error-error-class
Understand DELTA_ICEBERG_COMPAT_V1_VIOLATION errors https://learn.microsoft.com/en-us/azure/databricks/error-messages/delta-iceberg-compat-v1-violation-error-class
Handle DIVIDE_BY_ZERO errors in Databricks SQL https://learn.microsoft.com/en-us/azure/databricks/error-messages/divide-by-zero-error-class
Handle Azure Databricks named error conditions https://learn.microsoft.com/en-us/azure/databricks/error-messages/error-classes
Fix EWKB_PARSE_ERROR geometry parsing issues https://learn.microsoft.com/en-us/azure/databricks/error-messages/ewkb-parse-error-error-class
Fix EWKT_PARSE_ERROR geometry parsing issues https://learn.microsoft.com/en-us/azure/databricks/error-messages/ewkt-parse-error-error-class
Resolve GEOJSON_PARSE_ERROR in Databricks https://learn.microsoft.com/en-us/azure/databricks/error-messages/geojson-parse-error-error-class
Address GROUP_BY_AGGREGATE errors in Databricks SQL https://learn.microsoft.com/en-us/azure/databricks/error-messages/group-by-aggregate-error-class
Handle H3_INVALID_CELL_ID errors in Databricks https://learn.microsoft.com/en-us/azure/databricks/error-messages/h3-invalid-cell-id-error-class
Interpret and resolve H3_INVALID_GRID_DISTANCE_VALUE in Databricks https://learn.microsoft.com/en-us/azure/databricks/error-messages/h3-invalid-grid-distance-value-error-class
Handle H3_INVALID_RESOLUTION_VALUE errors in Databricks https://learn.microsoft.com/en-us/azure/databricks/error-messages/h3-invalid-resolution-value-error-class
Resolve H3_NOT_ENABLED errors and tier requirements https://learn.microsoft.com/en-us/azure/databricks/error-messages/h3-not-enabled-error-class
Fix INSUFFICIENT_TABLE_PROPERTY errors in Databricks https://learn.microsoft.com/en-us/azure/databricks/error-messages/insufficient-table-property-error-class
Troubleshoot INVALID_ARRAY_INDEX errors in Databricks SQL https://learn.microsoft.com/en-us/azure/databricks/error-messages/invalid-array-index-error-class
Troubleshoot INVALID_ARRAY_INDEX_IN_ELEMENT_AT in Databricks https://learn.microsoft.com/en-us/azure/databricks/error-messages/invalid-array-index-in-element-at-error-class
Resolve MISSING_AGGREGATION errors in Databricks queries https://learn.microsoft.com/en-us/azure/databricks/error-messages/missing-aggregation-error-class
Diagnose ROW_COLUMN_ACCESS errors for filters and masks https://learn.microsoft.com/en-us/azure/databricks/error-messages/row-column-access-error-class
Interpret Azure Databricks SQLSTATE error codes https://learn.microsoft.com/en-us/azure/databricks/error-messages/sqlstates
Fix TABLE_OR_VIEW_NOT_FOUND errors in Databricks https://learn.microsoft.com/en-us/azure/databricks/error-messages/table-or-view-not-found-error-class
Resolve UNRESOLVED_ROUTINE function resolution errors https://learn.microsoft.com/en-us/azure/databricks/error-messages/unresolved-routine-error-class
Understand UNSUPPORTED_TABLE_OPERATION errors in Databricks https://learn.microsoft.com/en-us/azure/databricks/error-messages/unsupported-table-operation-error-class
Understand UNSUPPORTED_VIEW_OPERATION errors in Databricks https://learn.microsoft.com/en-us/azure/databricks/error-messages/unsupported-view-operation-error-class
Troubleshoot WKB_PARSE_ERROR for geometry parsing https://learn.microsoft.com/en-us/azure/databricks/error-messages/wkb-parse-error-error-class
Troubleshoot WKT_PARSE_ERROR for geometry parsing https://learn.microsoft.com/en-us/azure/databricks/error-messages/wkt-parse-error-error-class
Troubleshoot MLflow 2 Agent Evaluation issues https://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-evaluation/troubleshooting
Troubleshoot and debug Databricks AI agents https://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-framework/debug-agent
Diagnose and fix common Genie Space issues https://learn.microsoft.com/en-us/azure/databricks/genie/troubleshooting
Troubleshoot common Databricks Auto Loader issues https://learn.microsoft.com/en-us/azure/databricks/ingestion/cloud-object-storage/auto-loader/faq
Resolve common Confluence connector ingestion issues https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/confluence-faq
Troubleshoot authentication and rate limit errors for Confluence https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/confluence-troubleshoot
Troubleshoot Dynamics 365 Lakeflow ingestion issues https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/d365-troubleshoot
Resolve common issues with Lakeflow managed connectors https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/faq
Troubleshoot Google Ads connector ingestion issues https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/google-ads-troubleshoot
Troubleshoot Google Analytics raw data ingestion issues https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/google-analytics-troubleshoot
Troubleshoot common HubSpot connector ingestion problems https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/hubspot-troubleshoot
Troubleshoot Jira Lakeflow ingestion errors https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/jira-troubleshoot
Troubleshoot Meta Ads ingestion connector issues https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/meta-ads-troubleshoot
Diagnose and fix MySQL Lakeflow Connect ingestion https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/mysql-troubleshoot
Troubleshoot common Outlook connector ingestion errors https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/outlook-troubleshoot
Troubleshoot PostgreSQL Lakeflow Connect ingestion issues https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/postgresql-troubleshoot
Troubleshoot Lakeflow Connect query-based connector issues https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/query-based-troubleshoot
Troubleshoot Salesforce Lakeflow ingestion problems https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/salesforce-troubleshoot
Diagnose and fix Databricks ServiceNow connector issues https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/servicenow-troubleshoot
Diagnose and fix Lakeflow SharePoint connector issues https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/sharepoint-troubleshoot
Troubleshoot Databricks Smartsheet connector errors https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/smartsheet-troubleshoot
Answer common SQL Server Lakeflow Connect connector questions https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/sql-server-faq
Resolve SQL Server Lakeflow Connect ingestion problems https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/sql-server-troubleshoot
Troubleshoot TikTok Ads connector in Lakeflow https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/tiktok-ads-troubleshoot
Troubleshoot Workday HCM connector in Lakeflow https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/workday-hcm-troubleshoot
Diagnose and fix Databricks Workday connector issues https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/workday-reports-troubleshoot
Resolve common Zendesk Support connector errors https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/zendesk-support-troubleshoot
Handle Zerobus Ingest errors and retries https://learn.microsoft.com/en-us/azure/databricks/ingestion/zerobus-errors
Inspect logs for Databricks init script execution https://learn.microsoft.com/en-us/azure/databricks/init-scripts/logs
Test and validate Databricks ODBC driver connections https://learn.microsoft.com/en-us/azure/databricks/integrations/odbc/testing
Configure and troubleshoot Lakeflow Jobs with many tasks https://learn.microsoft.com/en-us/azure/databricks/jobs/large-jobs
Monitor and inspect Lakeflow Jobs runs and history https://learn.microsoft.com/en-us/azure/databricks/jobs/monitor
Troubleshoot and repair Azure Databricks Lakeflow job failures https://learn.microsoft.com/en-us/azure/databricks/jobs/repair-job-failures
Monitor and troubleshoot Lakeflow Spark pipelines https://learn.microsoft.com/en-us/azure/databricks/ldp/observability
Use pipeline query history for debugging and tuning https://learn.microsoft.com/en-us/azure/databricks/ldp/query-history
Recover Lakeflow pipelines from streaming checkpoint failures https://learn.microsoft.com/en-us/azure/databricks/ldp/recover-streaming
User guides, migration, and troubleshooting for AI Runtime https://learn.microsoft.com/en-us/azure/databricks/machine-learning/ai-runtime/guides
Diagnose and resolve Databricks Feature Store issues https://learn.microsoft.com/en-us/azure/databricks/machine-learning/feature-store/troubleshooting-and-limitations
Debug common Databricks model serving issues https://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-serving/model-serving-debug
Diagnose Databricks model serving issues with Genie Code https://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-serving/model-serving-genie-code
Debug Python code in Databricks notebooks https://learn.microsoft.com/en-us/azure/databricks/notebooks/debugger
Troubleshoot failing Spark jobs and executors in Databricks https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/failing-spark-jobs
Use Databricks Spark jobs timeline for debugging https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/jobs-timeline
Diagnose long-running Spark stages in Databricks https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/long-spark-stage
Investigate high I/O Spark stages in Databricks https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/long-spark-stage-io
Debug slow low-I/O Spark stages in Databricks https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/slow-spark-stage-low-io
Identify expensive reads in Spark DAG on Databricks https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/spark-dag-expensive-read
Diagnose gaps between Spark jobs in Databricks https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/spark-job-gaps
Diagnose and fix Spark memory issues on Databricks https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/spark-memory-issues
Troubleshoot Azure Databricks Partner Connect issues https://learn.microsoft.com/en-us/azure/databricks/partner-connect/troubleshoot
Troubleshoot common Azure Databricks Git folder errors https://learn.microsoft.com/en-us/azure/databricks/repos/errors-troubleshooting
Fetch cursor rows and handle SQLSTATE in Databricks https://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/control-flow/fetch-stmt
Use GET DIAGNOSTICS for SQL error handling in Databricks https://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/control-flow/get-diagnostics-stmt
Open cursors and handle errors with OPEN in Databricks https://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/control-flow/open-stmt
Validate UTF-8 strings and handle INVALID_UTF8_STRING https://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/functions/validate_utf8
Interpret Databricks SQL query performance insights https://learn.microsoft.com/en-us/azure/databricks/sql/user/queries/performance-insights
Use Databricks SQL query history to debug performance https://learn.microsoft.com/en-us/azure/databricks/sql/user/queries/query-history
Analyze Databricks SQL query profiles to find bottlenecks https://learn.microsoft.com/en-us/azure/databricks/sql/user/queries/query-profile
Troubleshoot and configure Databricks SQL scheduled queries https://learn.microsoft.com/en-us/azure/databricks/sql/user/queries/schedule-query

Best Practices

Topic URL
Apply Databricks usage tags for accurate cost attribution https://learn.microsoft.com/en-us/azure/databricks/admin/account-settings/usage-detail-tags
Use default Databricks policy families to enforce compute best practices https://learn.microsoft.com/en-us/azure/databricks/admin/clusters/policy-families
Apply Azure Databricks identity configuration best practices https://learn.microsoft.com/en-us/azure/databricks/admin/users-groups/best-practices
Configure default deletion vectors for Databricks Delta tables https://learn.microsoft.com/en-us/azure/databricks/admin/workspace-settings/deletion-vectors
Apply best practices for Azure Databricks serverless workspaces https://learn.microsoft.com/en-us/azure/databricks/admin/workspace/serverless-workspaces-best-practices
Migrate Databricks library installs from init scripts https://learn.microsoft.com/en-us/azure/databricks/archive/compute/libraries-init-scripts
Apply compute policy best practices in Azure Databricks https://learn.microsoft.com/en-us/azure/databricks/archive/compute/policies-best-practices
Use DBIO for transactional writes to cloud storage in Databricks https://learn.microsoft.com/en-us/azure/databricks/archive/legacy/dbio-commit
Optimize skewed joins in Databricks using skew hints https://learn.microsoft.com/en-us/azure/databricks/archive/legacy/skew-join
Migrate from Databricks Deep Learning Pipelines https://learn.microsoft.com/en-us/azure/databricks/archive/spark-3.x-migration/deep-learning-pipelines
Use advanced techniques in Databricks metric views https://learn.microsoft.com/en-us/azure/databricks/business-semantics/metric-views/advanced-techniques
Apply Azure Databricks administration best practices https://learn.microsoft.com/en-us/azure/databricks/cheat-sheet/administration
Optimize BI performance with Databricks SQL warehouses https://learn.microsoft.com/en-us/azure/databricks/cheat-sheet/bi-serving
Prepare and model data for high-performance BI on Databricks https://learn.microsoft.com/en-us/azure/databricks/cheat-sheet/bi-serving-data-prep
Configure Databricks SQL warehouses for optimal BI serving https://learn.microsoft.com/en-us/azure/databricks/cheat-sheet/bi-serving-sql-serving
Apply Azure Databricks compute creation best practices https://learn.microsoft.com/en-us/azure/databricks/cheat-sheet/compute
Implement Azure Databricks production job scheduling best practices https://learn.microsoft.com/en-us/azure/databricks/cheat-sheet/jobs
Best practices for Power BI dashboards on Databricks https://learn.microsoft.com/en-us/azure/databricks/cheat-sheet/power-bi
Apply Databricks compute configuration best practices https://learn.microsoft.com/en-us/azure/databricks/compute/cluster-config-best-practices
Use flexible node types for reliable Databricks compute https://learn.microsoft.com/en-us/azure/databricks/compute/flexible-node-types
Apply best practices for Databricks pools https://learn.microsoft.com/en-us/azure/databricks/compute/pool-best-practices
Apply serverless compute best practices in Azure Databricks https://learn.microsoft.com/en-us/azure/databricks/compute/serverless/best-practices
Tune Databricks SQL warehouses for BI workloads https://learn.microsoft.com/en-us/azure/databricks/compute/sql-warehouse/bi-workload-settings
Use system table queries to monitor Databricks SQL warehouses https://learn.microsoft.com/en-us/azure/databricks/compute/sql-warehouse/monitor/queries
Control large interactive queries with Query Watchdog https://learn.microsoft.com/en-us/azure/databricks/compute/troubleshooting/query-watchdog
Apply observability best practices for Databricks jobs and pipelines https://learn.microsoft.com/en-us/azure/databricks/data-engineering/observability-best-practices
Best practices for designing Unity Catalog ABAC policies https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/abac/best-practices
Optimize performance of Unity Catalog ABAC policies https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/abac/performance
Apply Unity Catalog data governance best practices https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/best-practices
Apply row filters and column masks in Unity Catalog https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/filters-and-masks/
Work with legacy Hive metastore database objects https://learn.microsoft.com/en-us/azure/databricks/database-objects/hive-metastore
Follow DBFS root storage recommendations in Databricks https://learn.microsoft.com/en-us/azure/databricks/dbfs/dbfs-root
Migrate from DBFS mounts to Unity Catalog external locations https://learn.microsoft.com/en-us/azure/databricks/dbfs/mounts
Apply DBFS and Unity Catalog usage best practices https://learn.microsoft.com/en-us/azure/databricks/dbfs/unity-catalog
Optimize Delta Sharing to reduce cloud egress costs https://learn.microsoft.com/en-us/azure/databricks/delta-sharing/manage-egress
Apply Delta Lake best practices on Azure Databricks https://learn.microsoft.com/en-us/azure/databricks/delta/best-practices
Optimize tables with liquid clustering instead of partitioning https://learn.microsoft.com/en-us/azure/databricks/delta/clustering
Tune Azure Databricks data skipping with stats and Z-order https://learn.microsoft.com/en-us/azure/databricks/delta/data-skipping
Use deletion vectors to optimize Delta table updates https://learn.microsoft.com/en-us/azure/databricks/delta/deletion-vectors
Drop or replace Delta and Unity Catalog tables safely https://learn.microsoft.com/en-us/azure/databricks/delta/drop-table
Optimize Delta table file layout on Databricks https://learn.microsoft.com/en-us/azure/databricks/delta/optimize
Handle Delta Lake limitations on Amazon S3 https://learn.microsoft.com/en-us/azure/databricks/delta/s3-limitations
Use selective overwrite options in Delta Lake https://learn.microsoft.com/en-us/azure/databricks/delta/selective-overwrite
Control Delta table data file size on Azure Databricks https://learn.microsoft.com/en-us/azure/databricks/delta/tune-file-size
Vacuum Delta tables and manage retention safely https://learn.microsoft.com/en-us/azure/databricks/delta/vacuum
Optimize VARIANT data performance with shredding on Azure Databricks https://learn.microsoft.com/en-us/azure/databricks/delta/variant-shredding
Apply MLOps Stack best practices with bundles https://learn.microsoft.com/en-us/azure/databricks/dev-tools/bundles/mlops-stacks
Apply CI/CD best practices on Azure Databricks https://learn.microsoft.com/en-us/azure/databricks/dev-tools/ci-cd/best-practices
View Databricks cluster policy families via CLI https://learn.microsoft.com/en-us/azure/databricks/dev-tools/cli/reference/policy-families-commands
Apply security and performance best practices for Databricks apps https://learn.microsoft.com/en-us/azure/databricks/dev-tools/databricks-apps/best-practices
Test Databricks Connect for Python code with pytest https://learn.microsoft.com/en-us/azure/databricks/dev-tools/databricks-connect/python/testing
Handle async queries and interruptions in Databricks Connect https://learn.microsoft.com/en-us/azure/databricks/dev-tools/databricks-connect/queries
Choose between Databricks volumes and workspace files https://learn.microsoft.com/en-us/azure/databricks/files/files-recommendations
Orchestrate multi-agent systems with Supervisor Agent https://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-bricks/multi-agent-supervisor
Apply best practices for MLflow 2 evaluation sets https://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-evaluation/evaluation-set
Follow an end-to-end Databricks agents development workflow https://learn.microsoft.com/en-us/azure/databricks/generative-ai/guide/agents-dev-workflow
Measure RAG performance with Databricks metrics https://learn.microsoft.com/en-us/azure/databricks/generative-ai/tutorials/ai-cookbook/evaluate-assess-performance
Evaluate and monitor RAG apps on Databricks https://learn.microsoft.com/en-us/azure/databricks/generative-ai/tutorials/ai-cookbook/fundamentals-evaluation-monitoring-rag
Optimize Databricks RAG application quality https://learn.microsoft.com/en-us/azure/databricks/generative-ai/tutorials/ai-cookbook/quality-overview
Improve Databricks RAG chain quality https://learn.microsoft.com/en-us/azure/databricks/generative-ai/tutorials/ai-cookbook/quality-rag-chain
Write effective custom instructions for Genie Code https://learn.microsoft.com/en-us/azure/databricks/genie-code/instructions
Apply practical tips to improve Genie Code responses https://learn.microsoft.com/en-us/azure/databricks/genie-code/tips
Use Genie Agent mode for complex analysis https://learn.microsoft.com/en-us/azure/databricks/genie/agent-mode
Evaluate Genie Spaces using benchmarks https://learn.microsoft.com/en-us/azure/databricks/genie/benchmarks
Apply best practices to curate Genie Spaces https://learn.microsoft.com/en-us/azure/databricks/genie/best-practices
Migrate existing Auto Loader streams to file events https://learn.microsoft.com/en-us/azure/databricks/ingestion/cloud-object-storage/auto-loader/migrating-to-file-events
Configure Auto Loader for production workloads https://learn.microsoft.com/en-us/azure/databricks/ingestion/cloud-object-storage/auto-loader/production
Apply common COPY INTO data loading patterns https://learn.microsoft.com/en-us/azure/databricks/ingestion/cloud-object-storage/copy-into/examples
Apply common patterns for Lakeflow ingestion https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/common-patterns
Perform full refreshes of Lakeflow target tables https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/full-refresh
Maintain Lakeflow managed ingestion pipelines https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/pipeline-maintenance
Maintain and operate PostgreSQL ingestion pipelines in Lakeflow https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/postgresql-maintenance
Enable incremental ingestion for Salesforce formula fields https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/salesforce-formula-fields
Use Databricks init scripts for cluster customization https://learn.microsoft.com/en-us/azure/databricks/init-scripts/
Reference external files safely in Databricks init scripts https://learn.microsoft.com/en-us/azure/databricks/init-scripts/referencing-files
Configure and optimize compute for Lakeflow Jobs https://learn.microsoft.com/en-us/azure/databricks/jobs/compute
Build metadata-driven For each jobs with control tables https://learn.microsoft.com/en-us/azure/databricks/jobs/how-to/foreach-sql-lookup-tutorial
Apply best practices for configuring classic Lakeflow Jobs https://learn.microsoft.com/en-us/azure/databricks/jobs/run-classic-jobs
Apply cost optimization best practices on Databricks https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/cost-optimization/best-practices
Implement best practices for Databricks data and AI governance https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/data-governance/best-practices
Design observability and monitoring strategy for Azure Databricks https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/deployment-guide/observability
Apply interoperability and usability best practices on Databricks https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/interoperability-and-usability/best-practices
Implement operational excellence best practices on Databricks https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/operational-excellence/best-practices
Apply performance best practices for Azure Databricks lakehouse https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/performance-efficiency/best-practices
Apply reliability best practices on Databricks lakehouse https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/reliability/best-practices
Implement security and compliance best practices in Databricks https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/security-compliance-and-privacy/best-practices
Optimize Lakeflow pipelines with enhanced autoscaling https://learn.microsoft.com/en-us/azure/databricks/ldp/auto-scaling
Apply best practices for Lakeflow Spark Declarative Pipelines https://learn.microsoft.com/en-us/azure/databricks/ldp/best-practices
Use advanced AUTO CDC features and monitor processing metrics https://learn.microsoft.com/en-us/azure/databricks/ldp/cdc-advanced
Develop and test Lakeflow Spark Declarative Pipelines https://learn.microsoft.com/en-us/azure/databricks/ldp/develop
Manage Python dependencies safely in Databricks pipelines https://learn.microsoft.com/en-us/azure/databricks/ldp/developer/external-dependencies
Implement advanced expectation patterns at scale https://learn.microsoft.com/en-us/azure/databricks/ldp/expectation-patterns
Reduce high initialization times in Lakeflow pipelines https://learn.microsoft.com/en-us/azure/databricks/ldp/fix-high-init
Backfill historical data with Lakeflow pipelines https://learn.microsoft.com/en-us/azure/databricks/ldp/flows-backfill
Run full refresh operations for Databricks streaming tables safely https://learn.microsoft.com/en-us/azure/databricks/ldp/full-refresh-st
Optimize stateful stream processing with watermarks https://learn.microsoft.com/en-us/azure/databricks/ldp/stateful-processing
Design CDC and snapshot patterns in Databricks https://learn.microsoft.com/en-us/azure/databricks/ldp/what-is-change-data-capture
Restart the Python process to refresh Databricks libraries https://learn.microsoft.com/en-us/azure/databricks/libraries/restart-python-process
Apply data loading best practices on Databricks AI Runtime https://learn.microsoft.com/en-us/azure/databricks/machine-learning/ai-runtime/dataloading
Apply Hyperopt best practices on Azure Databricks https://learn.microsoft.com/en-us/azure/databricks/machine-learning/automl-hyperparam-tuning/hyperopt-best-practices
Improve Databricks AutoML forecasting with covariates https://learn.microsoft.com/en-us/azure/databricks/machine-learning/automl/automl-covariate-forecast
Implement point-in-time correct feature joins for time series ML https://learn.microsoft.com/en-us/azure/databricks/machine-learning/feature-store/time-series
Benchmark Databricks LLM endpoints for latency and TPS https://learn.microsoft.com/en-us/azure/databricks/machine-learning/foundation-model-apis/prov-throughput-run-benchmark
Implement LLMOps workflows on Azure Databricks https://learn.microsoft.com/en-us/azure/databricks/machine-learning/mlops/llmops
Validate models before Databricks serving deployment https://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-serving/model-serving-pre-deployment-validation
Monitor Databricks model quality and endpoint health https://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-serving/monitor-diagnose-endpoints
Optimize Databricks Model Serving endpoints for production https://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-serving/production-optimization
Load test Databricks Mosaic AI serving endpoints https://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-serving/what-is-load-test
Tune and scale Ray clusters on Databricks https://learn.microsoft.com/en-us/azure/databricks/machine-learning/ray/scale-ray
Follow deep learning best practices on Azure Databricks https://learn.microsoft.com/en-us/azure/databricks/machine-learning/train-model/dl-best-practices
Fine-tune Hugging Face models on a single GPU in Databricks https://learn.microsoft.com/en-us/azure/databricks/machine-learning/train-model/huggingface/fine-tune-model
Prepare datasets for Hugging Face fine-tuning on Databricks https://learn.microsoft.com/en-us/azure/databricks/machine-learning/train-model/huggingface/load-data
Adapt Apache Spark workloads for Azure Databricks https://learn.microsoft.com/en-us/azure/databricks/migration/spark
Align MLflow LLM judges with human evaluators https://learn.microsoft.com/en-us/azure/databricks/mlflow3/genai/eval-monitor/align-judges
Evaluate and compare MLflow prompt versions effectively https://learn.microsoft.com/en-us/azure/databricks/mlflow3/genai/prompt-version-mgmt/prompt-registry/evaluate-prompts
Use manual MLflow tracing for production GenAI apps https://learn.microsoft.com/en-us/azure/databricks/mlflow3/genai/tracing/app-instrumentation/manual-tracing/
Analyze GenAI traces for errors and performance https://learn.microsoft.com/en-us/azure/databricks/mlflow3/genai/tracing/observe-with-traces/analyze-traces
Run Databricks notebooks safely and efficiently https://learn.microsoft.com/en-us/azure/databricks/notebooks/run-notebook
Test and schedule Databricks notebook code https://learn.microsoft.com/en-us/azure/databricks/notebooks/test-notebooks
Create and manage Lakebase read replicas https://learn.microsoft.com/en-us/azure/databricks/oltp/projects/manage-read-replicas
Monitor Lakebase queries using pg_stat_statements https://learn.microsoft.com/en-us/azure/databricks/oltp/projects/pg-stat-statements
Apply performance optimization recommendations on Azure Databricks https://learn.microsoft.com/en-us/azure/databricks/optimizations/
Use adaptive query execution on Databricks https://learn.microsoft.com/en-us/azure/databricks/optimizations/aqe
Leverage cost-based optimizer in Databricks SQL https://learn.microsoft.com/en-us/azure/databricks/optimizations/cbo
Improve read performance with Databricks disk cache https://learn.microsoft.com/en-us/azure/databricks/optimizations/disk-cache
Improve Delta query performance with dynamic file pruning on Azure Databricks https://learn.microsoft.com/en-us/azure/databricks/optimizations/dynamic-file-pruning
Accelerate data access with predictive I/O https://learn.microsoft.com/en-us/azure/databricks/optimizations/predictive-io
Use predictive optimization for Unity Catalog tables https://learn.microsoft.com/en-us/azure/databricks/optimizations/predictive-optimization
Tune Azure Databricks range join optimization https://learn.microsoft.com/en-us/azure/databricks/optimizations/range-join
Diagnose Databricks Spark cost and performance in UI https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/
Debug skew and spill in Databricks Spark stages https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/long-spark-stage-page
Handle Databricks spot instance losses effectively https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/losing-spot-instances
Resolve long Spark stages with a single task https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/one-spark-task
Optimize many small Spark jobs on Databricks https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/small-spark-jobs
Mitigate overloaded Spark driver on Databricks https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/spark-driver-overloaded
Detect unnecessary data rewriting in Databricks Spark writes https://learn.microsoft.com/en-us/azure/databricks/optimizations/spark-ui-guide/spark-rewriting-data
Best practices for setting up Databricks Partner Connect https://learn.microsoft.com/en-us/azure/databricks/partner-connect/best-practice
Optimize joins with broadcast hints in Azure Databricks https://learn.microsoft.com/en-us/azure/databricks/pyspark/reference/functions/broadcast
Configure networking for Databricks Lakehouse Federation data sources https://learn.microsoft.com/en-us/azure/databricks/query-federation/networking
Optimize performance of Databricks Lakehouse Federation queries https://learn.microsoft.com/en-us/azure/databricks/query-federation/performance-recommendations
Transform complex and nested data types in Databricks https://learn.microsoft.com/en-us/azure/databricks/semi-structured/complex-types
Use higher-order functions on arrays in Databricks SQL https://learn.microsoft.com/en-us/azure/databricks/semi-structured/higher-order-functions
Differences between VARIANT and JSON strings in Databricks https://learn.microsoft.com/en-us/azure/databricks/semi-structured/variant-json-diff
Work with OBJECT type and VARIANT schemas in Databricks https://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/data-types/object-type
Use TIMESTAMP_NTZ type and Delta support in Databricks https://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/data-types/timestamp-ntz-type
Use VARIANT type and Iceberg compatibility in Databricks https://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/data-types/variant-type
Collect table statistics with ANALYZE TABLE for optimization https://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/sql-ref-syntax-aux-analyze-compute-statistics
Benchmark Databricks SQL warehouses with the TPC-DS dataset https://learn.microsoft.com/en-us/azure/databricks/sql/tpcds-eval
Optimize Databricks SQL queries using primary key constraints https://learn.microsoft.com/en-us/azure/databricks/sql/user/queries/query-optimization-constraints
Use Structured Streaming checkpoints safely on Databricks https://learn.microsoft.com/en-us/azure/databricks/structured-streaming/checkpoints
Configure Databricks Structured Streaming for production https://learn.microsoft.com/en-us/azure/databricks/structured-streaming/production
Optimize and monitor Databricks real-time streaming performance https://learn.microsoft.com/en-us/azure/databricks/structured-streaming/real-time/performance
Optimize stateless Structured Streaming queries on Databricks https://learn.microsoft.com/en-us/azure/databricks/structured-streaming/stateless-streaming
Apply watermarks for stateful streaming on Databricks https://learn.microsoft.com/en-us/azure/databricks/structured-streaming/watermarks
Use automatic Unity Catalog table upgrades in Databricks https://learn.microsoft.com/en-us/azure/databricks/tables/automatic-upgrades
Analyze Databricks table size and optimize storage costs https://learn.microsoft.com/en-us/azure/databricks/tables/size
Design data models optimized for Azure Databricks https://learn.microsoft.com/en-us/azure/databricks/transform/data-modeling
Optimize join performance for Azure Databricks workloads https://learn.microsoft.com/en-us/azure/databricks/transform/optimize-joins
Clean and validate data with Databricks batch and streaming https://learn.microsoft.com/en-us/azure/databricks/transform/validate
Optimize Unity Catalog batch Python UDF performance https://learn.microsoft.com/en-us/azure/databricks/udf/python-batch-udf
Tune Mosaic AI Vector Search performance and latency https://learn.microsoft.com/en-us/azure/databricks/vector-search/vector-search-best-practices
Design and run load tests for Vector Search endpoints https://learn.microsoft.com/en-us/azure/databricks/vector-search/vector-search-endpoint-load-test
Improve Mosaic AI Vector Search retrieval quality https://learn.microsoft.com/en-us/azure/databricks/vector-search/vector-search-retrieval-quality
Identify and clean up unused Databricks Vector Search endpoints https://learn.microsoft.com/en-us/azure/databricks/vector-search/vector-search-unused-endpoints
Download internet data into Azure Databricks volumes https://learn.microsoft.com/en-us/azure/databricks/volumes/download-internet-files

Decision Making

Topic URL
Manage and change Azure Databricks subscription tier https://learn.microsoft.com/en-us/azure/databricks/admin/account-settings/account
Create and manage Databricks budgets to track usage https://learn.microsoft.com/en-us/azure/databricks/admin/account-settings/budgets
Plan migration from Standard to Premium Databricks workspaces https://learn.microsoft.com/en-us/azure/databricks/admin/account-settings/standard-tier
Decide when and how to use serverless Databricks workspaces https://learn.microsoft.com/en-us/azure/databricks/admin/workspace/serverless-workspaces
Decide and migrate from dbx to Databricks bundles https://learn.microsoft.com/en-us/azure/databricks/archive/dev-tools/dbx/dbx-migrate
Migrate optimized LLM endpoints to provisioned throughput https://learn.microsoft.com/en-us/azure/databricks/archive/machine-learning/migrate-provisioned-throughput
Decide when to use Databricks Light runtime https://learn.microsoft.com/en-us/azure/databricks/archive/runtime/light
Plan migration of Databricks workloads to Spark 3.x https://learn.microsoft.com/en-us/azure/databricks/archive/spark-3.x-migration/
Select and manage the default Unity Catalog catalog https://learn.microsoft.com/en-us/azure/databricks/catalogs/default
Choose appropriate Azure Databricks compute types https://learn.microsoft.com/en-us/azure/databricks/compute/choose-compute
Select compatible flexible node types for Databricks compute https://learn.microsoft.com/en-us/azure/databricks/compute/flexible-node-type-instances
Decide when and how to use GPU Databricks compute https://learn.microsoft.com/en-us/azure/databricks/compute/gpu
Decide when and how to use Azure Databricks pools https://learn.microsoft.com/en-us/azure/databricks/compute/pool-index
Plan migration from classic to serverless Databricks compute https://learn.microsoft.com/en-us/azure/databricks/compute/serverless/migration
Choose and manage Azure Databricks SQL warehouse sizing and scaling https://learn.microsoft.com/en-us/azure/databricks/compute/sql-warehouse/warehouse-behavior
Choose between Databricks SQL warehouse types https://learn.microsoft.com/en-us/azure/databricks/compute/sql-warehouse/warehouse-types
Choose between ABAC and table-level filters in Unity Catalog https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/abac/abac-vs-rls-cm
Choose between managed and external Unity Catalog assets https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/managed-versus-external
Plan and execute upgrade of Databricks workspaces to Unity Catalog https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/upgrade/
Prepare and migrate to Unity Catalog–only Databricks workspaces https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/upgrade/uc-only-migration
Choose Delta Lake protocol versions and feature sets https://learn.microsoft.com/en-us/azure/databricks/delta/feature-compatibility
Choose local development tools for Azure Databricks https://learn.microsoft.com/en-us/azure/databricks/dev-tools/
Migrate from legacy to new Databricks CLI https://learn.microsoft.com/en-us/azure/databricks/dev-tools/cli/migrate
Manage Databricks account budget policies via CLI https://learn.microsoft.com/en-us/azure/databricks/dev-tools/cli/reference/account-budget-policy-commands
Configure Databricks account budgets using CLI https://learn.microsoft.com/en-us/azure/databricks/dev-tools/cli/reference/account-budgets-commands
Manage Databricks account usage dashboards via CLI https://learn.microsoft.com/en-us/azure/databricks/dev-tools/cli/reference/account-usage-dashboards-commands
Plan migration from legacy Databricks Connect runtimes https://learn.microsoft.com/en-us/azure/databricks/dev-tools/databricks-connect-legacy
Migrate from older to new Databricks Connect for Python https://learn.microsoft.com/en-us/azure/databricks/dev-tools/databricks-connect/python/migrate
Migrate from legacy to new Scala Databricks Connect https://learn.microsoft.com/en-us/azure/databricks/dev-tools/databricks-connect/scala/migrate
Choose and use Databricks SDKs for automation https://learn.microsoft.com/en-us/azure/databricks/dev-tools/sdks
Decide between CDKTF and Databricks Terraform provider https://learn.microsoft.com/en-us/azure/databricks/dev-tools/terraform/cdktf
Choose Unity Catalog integration method for external engines https://learn.microsoft.com/en-us/azure/databricks/external-access/integrations
Decide when to migrate agents to Databricks Apps https://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-framework/migrate-agent-to-apps
Use external models with Mosaic AI Model Serving https://learn.microsoft.com/en-us/azure/databricks/generative-ai/external-models/
Select Azure Databricks generative AI capabilities for your workflow https://learn.microsoft.com/en-us/azure/databricks/generative-ai/guide/gen-ai-capabilities
Choose between Databricks Free Edition and free trial https://learn.microsoft.com/en-us/azure/databricks/getting-started/free-trial-vs-free-edition
Choose incremental ingestion options from cloud object storage https://learn.microsoft.com/en-us/azure/databricks/ingestion/cloud-object-storage/
Select Auto Loader file detection mode for your workload https://learn.microsoft.com/en-us/azure/databricks/ingestion/cloud-object-storage/auto-loader/file-detection-modes
Plan migration of existing data to Delta Lake on Databricks https://learn.microsoft.com/en-us/azure/databricks/ingestion/data-migration/
Choose and set up MySQL Lakeflow ingestion https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/mysql
Plan and configure PostgreSQL Lakeflow ingestion https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/postgresql
Plan Microsoft SQL Server Lakeflow ingestion workflow https://learn.microsoft.com/en-us/azure/databricks/ingestion/lakeflow-connect/sql-server-overview
Choose and start with Databricks ODBC and JDBC drivers https://learn.microsoft.com/en-us/azure/databricks/integrations/jdbc-odbc-bi
Migrate from Simba Spark ODBC to Databricks ODBC https://learn.microsoft.com/en-us/azure/databricks/integrations/odbc/migration
Plan and manage production workloads with Lakeflow Jobs https://learn.microsoft.com/en-us/azure/databricks/jobs/
Decide when to run Lakeflow Jobs on serverless compute https://learn.microsoft.com/en-us/azure/databricks/jobs/run-serverless-jobs
Migrate from Spark Submit tasks in Databricks jobs https://learn.microsoft.com/en-us/azure/databricks/jobs/spark-submit
Plan production Azure Databricks lakehouse deployments https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/deployment-guide/
Design compute and workspace configuration for Azure Databricks https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/deployment-guide/compute
Choose a programming language for Azure Databricks https://learn.microsoft.com/en-us/azure/databricks/languages/overview
Choose triggered vs continuous mode for pipelines https://learn.microsoft.com/en-us/azure/databricks/ldp/pipeline-mode
Upgrade workspace feature tables to Unity Catalog https://learn.microsoft.com/en-us/azure/databricks/machine-learning/feature-store/uc/upgrade-feature-table-to-uc
Select Databricks-hosted foundation models for APIs https://learn.microsoft.com/en-us/azure/databricks/machine-learning/foundation-model-apis/supported-models
Migrate MLflow model versions to Unity Catalog https://learn.microsoft.com/en-us/azure/databricks/machine-learning/manage-model-lifecycle/migrate-models
Decide and migrate to Unity Catalog model management https://learn.microsoft.com/en-us/azure/databricks/machine-learning/manage-model-lifecycle/migrate-to-uc
Upgrade Databricks ML workflows to Unity Catalog https://learn.microsoft.com/en-us/azure/databricks/machine-learning/manage-model-lifecycle/upgrade-workflows
Choose Databricks options for batch model inference https://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-inference/
Migrate from legacy to Mosaic AI Model Serving https://learn.microsoft.com/en-us/azure/databricks/machine-learning/model-serving/migrate-model-serving
Decide when to use Spark vs. Ray on Databricks https://learn.microsoft.com/en-us/azure/databricks/machine-learning/ray/spark-ray-overview
Understand Azure Databricks generative model maintenance policy https://learn.microsoft.com/en-us/azure/databricks/machine-learning/retired-models-policy
Plan migration of data applications to Azure Databricks https://learn.microsoft.com/en-us/azure/databricks/migration/
Scope and plan ETL pipeline migration to Azure Databricks https://learn.microsoft.com/en-us/azure/databricks/migration/etl
Choose a migration path from Parquet to Delta Lake https://learn.microsoft.com/en-us/azure/databricks/migration/parquet-to-delta-lake
Plan migration from data warehouse to Databricks lakehouse https://learn.microsoft.com/en-us/azure/databricks/migration/warehouse-to-lakehouse
Decide and migrate from Agent Evaluation to MLflow 3 https://learn.microsoft.com/en-us/azure/databricks/mlflow3/genai/agent-eval-migration
Quick reference for migrating to MLflow 3 https://learn.microsoft.com/en-us/azure/databricks/mlflow3/genai/agent-eval-migration-reference
Choose between open source and managed MLflow on Databricks https://learn.microsoft.com/en-us/azure/databricks/mlflow3/genai/overview/oss-managed-diff
Choose compute resources for Databricks notebooks https://learn.microsoft.com/en-us/azure/databricks/notebooks/notebook-compute
Right-size Lakebase instance capacity and scaling https://learn.microsoft.com/en-us/azure/databricks/oltp/instances/create/capacity
Choose backup and restore methods for Lakebase https://learn.microsoft.com/en-us/azure/databricks/oltp/projects/backup-methods
Decide when and how to use Lakebase Autoscaling by default https://learn.microsoft.com/en-us/azure/databricks/oltp/upgrade-to-autoscaling
Configure incremental refresh for Databricks materialized views https://learn.microsoft.com/en-us/azure/databricks/optimizations/incremental-refresh
Choose pandas options on Azure Databricks https://learn.microsoft.com/en-us/azure/databricks/pandas/
Use Hive metastore federation in Unity Catalog migrations https://learn.microsoft.com/en-us/azure/databricks/query-federation/hms-federation-concepts
Migrate legacy Databricks query federation to Lakehouse Federation https://learn.microsoft.com/en-us/azure/databricks/query-federation/migrate
Plan and execute migration to Databricks Runtime 11.x https://learn.microsoft.com/en-us/azure/databricks/release-notes/runtime/11.x-migration
Migrate workloads to Databricks Runtime 12.x safely https://learn.microsoft.com/en-us/azure/databricks/release-notes/runtime/12.x-migration
Migrate workloads to Databricks Runtime 13.x safely https://learn.microsoft.com/en-us/azure/databricks/release-notes/runtime/13.x-migration
Migrate workloads to Databricks Runtime 14.x safely https://learn.microsoft.com/en-us/azure/databricks/release-notes/runtime/14.x-migration
Plan around Databricks Runtime and feature lifecycles https://learn.microsoft.com/en-us/azure/databricks/release-notes/runtime/databricks-runtime-ver
Understand serverless DBU billing by Azure Databricks SKU https://learn.microsoft.com/en-us/azure/databricks/resources/pricing
Plan and manage Azure Databricks serverless networking costs https://learn.microsoft.com/en-us/azure/databricks/security/network/serverless-network-security/cost-management
Decide between Spark Connect and Spark Classic https://learn.microsoft.com/en-us/azure/databricks/spark/connect-vs-classic
Choose between SparkR and sparklyr on Databricks https://learn.microsoft.com/en-us/azure/databricks/sparkr/sparkr-vs-sparklyr
Use SYNC to upgrade Hive tables to Unity Catalog https://learn.microsoft.com/en-us/azure/databricks/sql/language-manual/sql-ref-syntax-aux-sync
Choose Structured Streaming output modes on Databricks https://learn.microsoft.com/en-us/azure/databricks/structured-streaming/output-mode
Choose and implement Databricks transaction modes https://learn.microsoft.com/en-us/azure/databricks/transactions/transaction-modes
Optimize and manage Mosaic AI Vector Search costs https://learn.microsoft.com/en-us/azure/databricks/vector-search/vector-search-cost-management

Architecture & Design Patterns

Topic URL
Plan disaster recovery architecture for Azure Databricks https://learn.microsoft.com/en-us/azure/databricks/admin/disaster-recovery
Design and use materialization for Databricks metric views https://learn.microsoft.com/en-us/azure/databricks/business-semantics/metric-views/materialization
Implement fan-in and fan-out in Lakeflow pipelines https://learn.microsoft.com/en-us/azure/databricks/data-engineering/fan-in-fan-out
Choose patterns for external access to Databricks data https://learn.microsoft.com/en-us/azure/databricks/external-access/
Build an IDP pipeline with Databricks AI Functions https://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-bricks/idp-pipeline-tutorial
Design intelligent document processing pipelines on Databricks https://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-bricks/intelligent-document-processing
Design multi-agent orchestrator apps on Databricks https://learn.microsoft.com/en-us/azure/databricks/generative-ai/agent-framework/multi-agent-apps
Apply agent system design patterns on Azure Databricks https://learn.microsoft.com/en-us/azure/databricks/generative-ai/guide/agent-system-design-patterns
Design measurement infrastructure for RAG quality on Databricks https://learn.microsoft.com/en-us/azure/databricks/generative-ai/tutorials/ai-cookbook/evaluate-enable-measurement
Design and tune Databricks RAG inference chains https://learn.microsoft.com/en-us/azure/databricks/generative-ai/tutorials/ai-cookbook/fundamentals-inference-chain-rag
Design cost optimization architecture for Databricks lakehouse https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/cost-optimization/
Apply data and AI governance architecture on Databricks https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/data-governance/
Design Delta Lake and medallion data architecture on Databricks https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/deployment-guide/delta-lake
Design high availability and disaster recovery for Databricks lakehouse https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/deployment-guide/ha-dr
Design Azure Databricks network and connectivity architecture https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/deployment-guide/network
Design storage architecture for Azure Databricks and Unity Catalog https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/deployment-guide/storage
Design Azure Databricks workspace architecture strategy https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/deployment-guide/workspace-strategy
Design interoperability and usability architecture for Databricks lakehouse https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/interoperability-and-usability/
Design operational excellence architecture for Databricks lakehouse https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/operational-excellence/
Design performance efficiency architecture for Databricks lakehouse https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/performance-efficiency/
Apply Azure Databricks lakehouse reference architectures https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/reference
Design reliability architecture for Databricks lakehouse https://learn.microsoft.com/en-us/azure/databricks/lakehouse-architecture/reliability/
Apply medallion lakehouse architecture on Databricks https://learn.microsoft.com/en-us/azure/databricks/lakehouse/medallion
Choose Databricks ML model deployment patterns https://learn.microsoft.com/en-us/azure/databricks/machine-learning/mlops/deployment-patterns
Implement MLOps workflows on Azure Databricks https://learn.microsoft.com/en-us/azure/databricks/machine-learning/mlops/mlops-workflow
Choose and train deep learning recommender models on Databricks https://learn.microsoft.com/en-us/azure/databricks/machine-learning/train-recommender-models
Use Lakebase branches for database development workflows https://learn.microsoft.com/en-us/azure/databricks/oltp/projects/branches
Design for high availability with Lakebase computes https://learn.microsoft.com/en-us/azure/databricks/oltp/projects/high-availability
Scale reads with Lakebase read replicas https://learn.microsoft.com/en-us/azure/databricks/oltp/projects/read-replicas
Connect Databricks Serverless Private Git to on-prem Git https://learn.microsoft.com/en-us/azure/databricks/repos/connect-on-prem-git-server
Set up Databricks Serverless Private Git with Private Link https://learn.microsoft.com/en-us/azure/databricks/repos/serverless-private-git
Choose patterns for modeling semi-structured data on Databricks https://learn.microsoft.com/en-us/azure/databricks/semi-structured/
Choose async checkpointing for Databricks stateful queries https://learn.microsoft.com/en-us/azure/databricks/structured-streaming/async-checkpointing
Use async progress tracking in Databricks streaming https://learn.microsoft.com/en-us/azure/databricks/structured-streaming/async-progress-checking
Decide when and how to partition Delta tables https://learn.microsoft.com/en-us/azure/databricks/tables/partitions