Databricks URI format

URI definition for Databricks follows this standard:

Identifier fragments:

  • Namespace: databricks://{host}
    • Scheme = databricks
    • Authority = {host}
  • Unique name: {catalog}.{schema}.{table}

URI format:

  • databricks://{host}/{catalog}.{schema}.{table}

Parameters limitations:

  • host identifier must start with a lowercase letter and include only lowercase letters, numbers, dash and dot (regex: [a-z][a-z0-9-.]+)
  • catalog, schema and table can have two format, following databricks naming standards: quoted and unquoted
    • unquoted identifier must include only lowercases, uppercases, numbers and underscore (regex:[a-zA-Z0-9_]+ )
    • quoted identifier needs to be quoted using backtick ``` and must include only characters included in those unicode ranges:
    • U+0021 to U+002D
    • U+0030 to U+005F
    • U+0061 to U+007E
    • U+0080 to U+FFFF

Examples:

  • databricks://sifflet-test-databricks.auzgerc1.eu-west-1.rds.amazonaws.com/sample_catalog.schema.monitoring_data
  • databricks://sifflet-test-databricks.auzgerc1.eu-west-1.rds.amazonaws.com/SAMPLE_CATALOG.SCHEMA.MONITORING_DATA
  • databricks://testing-databricks.cfbu.us-west-1.rds/prod_catalog.`MY-SCHEMA`.`테스트-데이터1`