Unified catalog
Auto-discover datasets across AWS, Azure, GCP and on-prem sources. Browse, search and request access in one place.
Cloud Datasets brings together structured, semi-structured and unstructured data with rich lineage, ownership and quality signals.
Auto-discover datasets across AWS, Azure, GCP and on-prem sources. Browse, search and request access in one place.
Column-level lineage and freshness, completeness and drift checks surfaced on every dataset.
Share datasets across teams, partners and regions without copying data. Revocation in a single click.
Standards-based metadata APIs and Iceberg-compatible tables — no lock-in.
Bring in warehouses, lakes, SaaS apps and file stores with one-click connectors.
PII detection, business glossary tagging and ownership inferred from usage.
Define policies once and enforce them across SQL, BI, notebooks and AI agents.
Cross-team, cross-cloud and cross-region sharing with full audit and revocation.
Your data stays in your cloud accounts. Avaloka indexes metadata and brokers access — we don't move or copy your data.
PII is automatically detected and tagged. You can apply masking policies that travel with the dataset across every consumer.
Yes. Secure data sharing supports external accounts with expiring access, watermarking and full audit trails.
All metadata is exposed via REST and Iceberg-compatible APIs, so you can integrate with dbt, Great Expectations and your own tools.
See how Cloud Datasets unifies your data estate in a single, governed catalog.