Synthetic Dataset Creation Spaces
Spaces focused on generating synthetic datasets
Running139π§¬Note A space which allows you to build datasets using natural language. Uses Distilabel under the hood
Running229βΎοΈπInfinite Dataset Hub
Search and save datasets generated with a LLM in real time
Note Search for a dataset you want and it'll be created just for you using Phi-3-mini-4k-instruct!
Running12πWould You Read It
Note Would you read a book generated by an LLM? This experimental Space creates an LLM-generated blurb and allows users to vote on whether the blurb is good, contributing to an open preference dataset π€ This Space might give you ideas for creating your synthetic preference dataset from the community!
Running18π»π³synthetic-data-workshop
Note This Space is designed to provide you with an easy way to get started generating synthetic datasets using Spaces compute to host open LLMs. The Space comes with a ready-to-go environment and a series of notebooks showing various examples of generating synthetic datasets
Running on Zero67π¦ββ¬Magpie
Note This demo showcases Magpie, an innovative approach to generating high-quality data by prompting aligned LLMs with their pre-query templates. This Space also allows users to rate the generations to create preference data!