Hugging Face · Arazzo Workflow
Hugging Face Dataset Size and Parquet Files
Version 1.0.0
Confirm a dataset on the Hub, read its size profile, then list its Parquet files.
Provider
Workflows
dataset-size-and-parquet
Verify a dataset, read its size profile, and list its Parquet files.
Confirms a dataset exists on the Hub, retrieves its size information from the Dataset Viewer, and lists its converted Parquet files.
1
confirmDataset
$sourceDescriptions.hubApi.getDataset
Confirm the dataset exists on the Hub before querying the viewer for its size and Parquet files.
2
getSize
$sourceDescriptions.datasetViewerApi.getDatasetSize
Read the dataset's size profile including row counts and byte sizes for the full dataset and for each subset and split.
3
listParquet
$sourceDescriptions.datasetViewerApi.getParquetFiles
List the auto-converted Parquet files for the dataset so a consumer can plan efficient bulk access.
Source API Descriptions
openapi