Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Datasets:
RUC-NLPIR
/
Omnimodal-Agent-SFT-2K

Tasks:
Question Answering
Visual Question Answering
Modalities:
Audio
Image
Text
Formats:
parquet
Languages:
English
Size:
1K - 10K
ArXiv:
Tags:
multimodal
benchmark
agent
tool-use
Libraries:
Datasets
pandas
Polars
License:
Dataset card Data Studio Files Files and versions
xet
Community
1
Omnimodal-Agent-SFT-2K
  • 1 contributor
History: 16 commits
lixiaoxi45's picture
lixiaoxi45
Update README.md
067e71b verified 12 days ago
  • assets
    Upload folder using huggingface_hub 21 days ago
  • data
    Upload data/train_metadata.parquet with huggingface_hub 23 days ago
  • data_media_train
    Delete data_media_train/audio_info_5.json 23 days ago
  • raw
    Upload raw/train_metadata.json with huggingface_hub 23 days ago
  • .gitattributes
    2.69 kB
    Upload raw/train_metadata.json with huggingface_hub 23 days ago
  • README.md
    7.03 kB
    Update README.md 12 days ago