Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Datasets:
RUC-NLPIR
/
Omnimodal-Agent-SFT-2K
like
7
Follow
NLPIR Lab @ RUC
40
Tasks:
Question Answering
Visual Question Answering
Modalities:
Audio
Image
Text
+ 1
Formats:
parquet
Languages:
English
Size:
1K - 10K
ArXiv:
arxiv:
2602.22897
Tags:
multimodal
benchmark
agent
tool-use
Libraries:
Datasets
pandas
Polars
+ 1
License:
apache-2.0
Dataset card
Data Studio
Files
Files and versions
xet
Community
1
main
Omnimodal-Agent-SFT-2K
1 contributor
History:
16 commits
lixiaoxi45
Update README.md
067e71b
verified
12 days ago
assets
Upload folder using huggingface_hub
21 days ago
data
Upload data/train_metadata.parquet with huggingface_hub
23 days ago
data_media_train
Delete data_media_train/audio_info_5.json
23 days ago
raw
Upload raw/train_metadata.json with huggingface_hub
23 days ago
.gitattributes
Safe
2.69 kB
Upload raw/train_metadata.json with huggingface_hub
23 days ago
README.md
Safe
7.03 kB
Update README.md
12 days ago