Sahara Documentation
User Guide: AI Developer Platform
User Guide: AI Developer Platform
  • Dataset Registry & Tokenization
  • Troubleshooting
  • API Documentation
Powered by GitBook
On this page
  • Overview
  • 1. Accessing the Data Hub
  • 2. Searching & Filtering Datasets
  • 3. Viewing Dataset Details
  • 4. Purchasing a Dataset
  • 5. Managing Purchased Datasets (My Datasets)
  • 6. Using Datasets in Model Hub (Importing to Vaults)
  • 7. Uploading Additional Data to a Vault
  • Conclusion

Data Hub

Last updated 27 days ago

Overview

The Sahara AI Data Hub is a marketplace for AI-ready datasets, where users can browse, search, filter, and purchase datasets for use in AI pipelines. Purchased datasets can be managed within My Datasets and later imported into Vaults for AI processing.

This guide covers:

  1. Navigating the Data Hub

  2. Searching & Filtering Datasets

  3. Viewing Dataset Details

  4. Purchasing Datasets

  5. Managing Purchased Datasets (My Datasets)

  6. Using Datasets in Model Hub (Importing to Vaults)

1. Accessing the Data Hub

  1. Navigate to AI Studio in the Sahara AI platform.

  2. Click Data Hub to enter the dataset marketplace.

Once inside the Data Hub, you will see two primary tabs:

  • All Datasets (default): Browse all available datasets.

  • My Datasets: View datasets you have purchased or accessed.

(add image)

2. Searching & Filtering Datasets

Searching for a Dataset

  1. In the All Datasets tab, use the search bar at the top to find datasets by name or keywords.

Filtering Datasets

  1. Filter by Price: Sort between free and paid datasets.

  2. Filter by Topics: View datasets by relevant categories.

Sorting Options

  1. Most Recent: Displays the latest updated datasets first.

  2. Lowest Price: Sorts datasets from free to the most expensive.

Pagination & Items per Page

  • View the total number of datasets available.

  • Adjust how many datasets appear per page (e.g., 10, 20, or more).

  • Navigate through dataset pages.

Dataset Information

Each dataset listing provides high-level information, including:

  • Name

  • Description

  • Size

  • Language

  • Type

  • Tags

  • Owner

  • Date Updated

  • Price

3. Viewing Dataset Details

  1. Click on any dataset in All Datasets or My Datasets.

  2. You will be directed to the Dataset Details Page, which contains:

Description Tab:

  • Detailed dataset description.

Data Information Tab:

  • Metadata such as format, structure, and compilation settings.

Provider Tab:

  • Information about the dataset owner or contributor.

4. Purchasing a Dataset

If a dataset requires payment, you need sufficient credits, points, or fiat currency.

How to Purchase a Dataset

  1. Click Access on the Dataset Details Page.

  2. A confirmation pop-up will appear:

  • "Are you sure you want to purchase this dataset?"

  • Select OK to confirm or Cancel to abort.

If the purchase is successful:

  • A system message will confirm the transaction.

  • The Access button will change to Purchased.

If you lack sufficient funds, the purchase will fail, and you will be prompted to add more credits.

To return to browsing, click "< Marketplace" in the top left.

5. Managing Purchased Datasets (My Datasets)

  1. Navigate to My Datasets in the Data Hub.

  2. This page lists all datasets you have purchased.

  3. Use the search bar to find a specific dataset.

  4. Click on a dataset to open its Dataset Details Page.

⚠️ Note: Purchased datasets cannot be removed from your account.

6. Using Datasets in Model Hub (Importing to Vaults)

To use a dataset in an AI pipeline, it must first be imported into a Vault.

How to Import a Purchased Dataset to a Vault

  1. Navigate to Vaults:

  • Go to AI Studio → My Vaults.

  1. Select a Vault:

  • Click View Details on the vault where you want to import data.

  1. Click "Import":

  2. A pop-up window will appear with a search bar.

  3. Search for Purchased Dataset:

  • Locate the dataset you purchased in the Data Hub.

  1. Select the Dataset:

  • Click the checkbox next to the dataset.

  1. Click "Import Selected":

  • The dataset will now be available in your Vault.

7. Uploading Additional Data to a Vault

Users can also upload their own datasets to Vaults.

  1. Click Upload on the Vault Detail Page.

  2. Enter the following details in the pop-up:

  • Dataset Name (Required)

  • Remarks (Optional)

  • Select File (Supported formats: CSV, JSON, Parquet)

  1. Click Save to upload.

Conclusion

The Sahara AI Data Hub is a robust marketplace for acquiring AI training data. By efficiently searching, filtering, purchasing, and managing datasets, users can enhance their AI models and deploy them through Sahara’s decentralized ecosystem. Purchased datasets are permanently stored under My Datasets and can be imported into Vaults for use in AI pipelines.

For further guidance, visit AI Studio → Data Hub → Help Center.