Coder Social home page Coder Social logo

thefcraft / civitai-stable-diffusion-337k Goto Github PK

View Code? Open in Web Editor NEW
6.0 1.0 0.0 15 KB

Civitai Stable Diffusion 337k Dataset; dataset of ai generated image

Home Page: https://huggingface.co/datasets/thefcraft/civitai-stable-diffusion-337k

Python 100.00%
civitai dataset image-classification image-generation image2text stable-diffusion

civitai-stable-diffusion-337k's Introduction

Description:

Overview:

The Civitai Stable Diffusion 337k is a dataset containing 337k Civitai image URLs accompanied by detailed prompts and other meta-information. The dataset is primarily sourced using the Civitai API to obtain an exhaustive list of prompts associated with each image.

Explore the dataset on HuggingFace🤗

Quick Usage:

from datasets import load_dataset

dataset = load_dataset("thefcraft/civitai-stable-diffusion-337k")
print(dataset['train'][0])

Dataset Structure:

The primary dataset is structured in JSON, huggingface format with a detailed breakdown of image attributes for each item. An individual item contains:

  • Image URL, Hash, Dimensions, NSFW flag, Creation date, and Post ID.
  • Statistical data like cry, laugh, like, dislike, heart counts, and comment count.
  • Metadata including model details, prompts, sampler info, and other configuration details.
  • User details for each image.

A sample structure is provided below:

{
    'items':[
      {'id': 100657,
       'url': 'https://imagecache.civitai.com/xG1nkqKTMzGDvpLrqFT7WA/2338276a-87f7-4a1e-f92a-776a18ee4200/width=768/2338276a-87f7-4a1e-f92a-776a18ee4200.jpeg',
       'hash': 'U5Exz_00.8D$t89Z%M0100~VD*RktQxaIU~p',
       'width': 768,
       'height': 1368,
       'nsfw': True,
       'createdAt': '2023-02-14T10:05:11.498Z',
       'postId': 60841,
       'stats': {'cryCount': 0,
                 'laughCount': 0,
                 'likeCount': 26,
                 'dislikeCount': 0,
                 'heartCount': 50,
                 'commentCount': 4},
       'meta': {'ENSD': '31337',
                'Size': '512x912',
                'seed': 3994946333,
                'Model': 'AbyssOrangeMix2_sfw',
                'steps': 20,
                'prompt': '<lora:hiqcg_body-epoch-000004:0.5>, <lora:hiqcg_face-epoch-000004:0.4>, hiqcgbody, hiqcgface, 1girl, full body, standing, \ndetailed skin texture, detailed cloth texture,  beautiful detailed face,\nmasterpiece, best quality, ultra detailed, 8k, intricate details,',
                'sampler': 'DPM++ 2M Karras',
                'cfgScale': 7,
                'Clip skip': '2',
                'resources': [{'hash': '038ba203d8',
                               'name': 'AbyssOrangeMix2_sfw',
                               'type': 'model'}],
                'Model hash': '038ba203d8',
                'Hires upscale': '1.5',
                'Hires upscaler': 'Latent',
                'negativePrompt': 'EasyNegative, extra fingers,fewer fingers, multiple girls, multiple views,',
                'Denoising strength': '0.6'},
        'username': 'NeoClassicalRibbon'},
      {..},
       ..],

'metadata':{'totalItems': 327145}
}

Project and Training:

An AI model was trained on this dataset for NSFW prompt detection. Details and code for the project can be found at this GitHub repository.

civitai-stable-diffusion-337k's People

Contributors

thefcraft avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.