Data Hub
Last updated
Last updated
The Sahara AI Data Hub is a marketplace for AI-ready datasets, where users can browse, search, filter, and purchase datasets for use in AI pipelines. Purchased datasets can be managed within My Datasets and later imported into Vaults for AI processing.
This guide covers:
Navigating the Data Hub
Searching & Filtering Datasets
Viewing Dataset Details
Purchasing Datasets
Managing Purchased Datasets (My Datasets)
Using Datasets in Model Hub (Importing to Vaults)
Navigate to AI Studio in the Sahara AI platform.
Click Data Hub to enter the dataset marketplace.
Once inside the Data Hub, you will see two primary tabs:
All Datasets (default): Browse all available datasets.
My Datasets: View datasets you have purchased or accessed.
(add image)
In the All Datasets tab, use the search bar at the top to find datasets by name or keywords.
Filter by Price: Sort between free and paid datasets.
Filter by Topics: View datasets by relevant categories.
Most Recent: Displays the latest updated datasets first.
Lowest Price: Sorts datasets from free to the most expensive.
View the total number of datasets available.
Adjust how many datasets appear per page (e.g., 10, 20, or more).
Navigate through dataset pages.
Dataset Information
Each dataset listing provides high-level information, including:
Name
Description
Size
Language
Type
Tags
Owner
Date Updated
Price
Click on any dataset in All Datasets or My Datasets.
You will be directed to the Dataset Details Page, which contains:
Description Tab:
Detailed dataset description.
Data Information Tab:
Metadata such as format, structure, and compilation settings.
Provider Tab:
Information about the dataset owner or contributor.
If a dataset requires payment, you need sufficient credits, points, or fiat currency.
Click Access on the Dataset Details Page.
A confirmation pop-up will appear:
"Are you sure you want to purchase this dataset?"
Select OK to confirm or Cancel to abort.
If the purchase is successful:
A system message will confirm the transaction.
The Access button will change to Purchased.
If you lack sufficient funds, the purchase will fail, and you will be prompted to add more credits.
To return to browsing, click "< Marketplace" in the top left.
Navigate to My Datasets in the Data Hub.
This page lists all datasets you have purchased.
Use the search bar to find a specific dataset.
Click on a dataset to open its Dataset Details Page.
⚠️ Note: Purchased datasets cannot be removed from your account.
To use a dataset in an AI pipeline, it must first be imported into a Vault.
Navigate to Vaults:
Go to AI Studio → My Vaults.
Select a Vault:
Click View Details on the vault where you want to import data.
Click "Import":
A pop-up window will appear with a search bar.
Search for Purchased Dataset:
Locate the dataset you purchased in the Data Hub.
Select the Dataset:
Click the checkbox next to the dataset.
Click "Import Selected":
The dataset will now be available in your Vault.
Users can also upload their own datasets to Vaults.
Click Upload on the Vault Detail Page.
Enter the following details in the pop-up:
Dataset Name (Required)
Remarks (Optional)
Select File (Supported formats: CSV, JSON, Parquet)
Click Save to upload.
The Sahara AI Data Hub is a robust marketplace for acquiring AI training data. By efficiently searching, filtering, purchasing, and managing datasets, users can enhance their AI models and deploy them through Sahara’s decentralized ecosystem. Purchased datasets are permanently stored under My Datasets and can be imported into Vaults for use in AI pipelines.
For further guidance, visit AI Studio → Data Hub → Help Center.