Skip to content
Change the repository type filter

All

    Repositories list

    • docling

      Public
      🥚 Transform PDF to JSON or Markdown with ease and speed 🐣
      Python
      MIT License
      66668165Updated Oct 14, 2024Oct 14, 2024
    • Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.
      C++
      MIT License
      21442Updated Oct 14, 2024Oct 14, 2024
    • A python library to define and validate data types in Docling.
      Python
      MIT License
      21531Updated Oct 14, 2024Oct 14, 2024
    • Simple package to extract text with coordinates from programmatic PDFs
      C++
      MIT License
      31230Updated Oct 11, 2024Oct 11, 2024
    • Running Docling as an API service
      Makefile
      MIT License
      2700Updated Oct 11, 2024Oct 11, 2024
    • Python
      MIT License
      22021Updated Oct 10, 2024Oct 10, 2024
    • MolGrapher: Graph-based Visual Recognition of Chemical Structures
      Python
      MIT License
      14200Updated Oct 9, 2024Oct 9, 2024
    • CSS
      MIT License
      1600Updated Oct 8, 2024Oct 8, 2024
    • ci-tester

      Public
      0000Updated Sep 20, 2024Sep 20, 2024
    • quackling

      Public archive
      Build document-native LLM applications
      Python
      MIT License
      14700Updated Sep 11, 2024Sep 11, 2024
    • Interact with the Deep Search platform for new knowledge explorations and discoveries
      Python
      MIT License
      18124811Updated Sep 9, 2024Sep 9, 2024
    • Mognet is a fast, simple framework to build distributed applications using task queues.
      Python
      MIT License
      2801Updated Aug 7, 2024Aug 7, 2024
    • Examples using the Deep Search functionalities
      Python
      MIT License
      133904Updated Aug 7, 2024Aug 7, 2024
    • PatCID

      Public
      Python
      MIT License
      01910Updated Aug 2, 2024Aug 2, 2024
    • Python
      MIT License
      0500Updated Jul 8, 2024Jul 8, 2024
    • Python
      MIT License
      0500Updated Jul 8, 2024Jul 8, 2024
    • SemTabNet

      Public
      Repository for ACL paper: "Statements: Universal Information Extraction from Tables with Large Language Models for ESG KPIs"
      Python
      MIT License
      0300Updated Jul 1, 2024Jul 1, 2024
    • .github

      Public
      0000Updated Jun 24, 2024Jun 24, 2024
    • MolGrapher: Graph-based Visual Recognition of Chemical Structures
      Python
      MIT License
      0500Updated Mar 25, 2024Mar 25, 2024
    • Repository to detect scientific software in documents for Chan Zuckerberg Initiative workshop
      Python
      MIT License
      0100Updated Oct 26, 2023Oct 26, 2023
    • langchain

      Public
      ⚡ Building applications with LLMs through composability ⚡
      Python
      MIT License
      15k000Updated May 18, 2023May 18, 2023
    • Website of the ICDAR 2023 DocLayNet competition
      1000Updated Apr 26, 2023Apr 26, 2023
    • DocLayNet

      Public
      DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis
      Other
      1525130Updated Feb 1, 2023Feb 1, 2023
    • Example NLP Annotator API used for integrating with the IBM DeepSearch CPS platform
      Python
      Apache License 2.0
      3900Updated Sep 8, 2022Sep 8, 2022