In this tutorial, you’ll learn how to use Auto Populate Catalog in Oracle AI Data Platform to automatically register metadata from an existing data lake, such as OCI Object Storage. This capability helps streamline catalog population by eliminating manual metadata registration while keeping data in place for analysis and discovery.
In this guide, you’ll learn how to:
- Create a metadata extractor to automate catalog population from OCI Object Storage
- Configure source details, including object storage URI, compartment, and bucket location
- Select a compute cluster to run the metadata extraction workload
- Map folders to catalog tables, registering structures that reference data in place
- Control entity creation using manual or automatic approval options
- Review, validate, and approve extracted tables and schema metadata before registration
- Verify populated catalog objects for downstream data discovery and analytics use
By leveraging Auto Populate Catalog, you can quickly register the structure of your data lake without moving or duplicating data. This improves data visibility and discovery, while supporting governance across your organization. Watch the full tutorial to see how to configure and validate metadata extraction in your environment. For more information about creating and managing automated extractors to populate your catalog with metadata, check out Oracle AI Data Platform documentation.