llm-datasets

Here are 11 public repositories matching this topic...

aloobun / ccpem-modified

A modified dataset consisting of English dialogs between a user and an assistant discussing movie preferences in natural language.

dataset llm-datasets

Updated Sep 29, 2023

Synthetically Generating Intent-Aware Information-Seeking Dialogues! Useful for various tasks such as training/evaluating User Intent Predictors with the possibility to training/evaluating on real human dialogues. The backbone LLM of SOLID is Zephyr-7b-beta.

solid dataset-generation conversational-ai intent-classification llm-training llm-inference llm-datasets llm-dialogs llm-conversations zephyr-7b-beta intent-aware-conversation-generation solid-rl

Updated Aug 18, 2024
Python

altunenes / rustysozluk

Sponsor

Star

Efficiently fetch and perform sentiment analysis (Turkish Only) on eksisozluk.com entries using Rust

rust scraper sentiment-analysis turkish eksisozluk rust-lang webscraping eksi-sozluk reqwest duyguanalizi rust-scraping llm-training llm-datasets

Updated Feb 8, 2024
Rust

jsurrea / LLM-Latino

Star

Collection of ETL scripts used to create a dataset of text in Spanish to train Large Language Models.

python web-scraping google-cloud-platform etl-pipeline llm-datasets

Updated Aug 5, 2024
Python

DefinetlyNotAI / LLM_Data

Sponsor

Star

A bunch of very famous repos source code's in python as pure localdocs all in this repo to train CODE AI

c data cpp cuda jupyter-notebook python3 code-examples llm llm-datasets data-dum programming-data programming-data-sets llm-code

Updated Sep 14, 2024
Python

aloobun / basedUX

Star

minimal dataset conisting og 363 Human & Assitant dialogs

dataset llm-datasets

Updated Oct 1, 2023

redblock-ai / parrot-python

Star

PARROT (Performance Assessment of Reasoning and Responses On Trivia) is a novel benchmarking framework designed to evaluate Large Language Models (LLMs) on real-world, complex, and ambiguous QA tasks.

benchmarking-framework llm-inference llm-datasets llm-qa-document llm-benchmarking