Skip to content
Change the repository type filter

All

    Repositories list

    • A Flexible Framework for Comprehensive Multimodal Model Evaluation
      Python
      45400Updated Dec 19, 2024Dec 19, 2024
    • .github

      Public
      0000Updated Nov 8, 2024Nov 8, 2024
    • HalluDial

      Public
      Python
      11510Updated Aug 19, 2024Aug 19, 2024
    • CSS
      0000Updated Jul 18, 2024Jul 18, 2024
    • FlagEval

      Public
      FlagEval is an evaluation toolkit for AI large foundation models.
      Python
      Apache License 2.0
      2731142Updated Jul 13, 2024Jul 13, 2024
    • CMMU

      Public
      [IJCAI 2024] CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning
      Python
      02300Updated Feb 1, 2024Feb 1, 2024