File size: 5,342 Bytes
ef156f7
 
35a8995
 
1ca4150
 
6aba279
ef156f7
91e0b58
31a22f8
91e0b58
74f4b45
91e0b58
31a22f8
9e0d279
 
 
8f2e5df
35a8995
8f2e5df
31a22f8
eb27927
31a22f8
 
35a8995
31a22f8
35a8995
31a22f8
35a8995
31a22f8
 
 
 
 
 
 
 
388f09e
35a8995
6aeba59
7378f29
9e0d279
 
8a1d8b6
 
 
1ca4150
35a8995
f8c89f0
 
 
0e3cde0
 
 
31a22f8
 
 
 
9e0d279
2966b25
35a8995
31a22f8
 
 
 
52dfdd9
 
 
cdb54c9
9b65d07
 
 
 
 
cdb54c9
 
2d35e47
cdb54c9
9b65d07
2966b25
31a22f8
b46ccc5
d51d402
 
 
 
 
 
31a22f8
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
---
license: cc0-1.0
tags:
- art
- stable-diffusion
- text-to-image
- cc0
---

# CC0_rebuild_attempt

**Version Number:** 0.2

## Summary



CC0_rebuild_attempt is a text-to-image model based on the Stable Diffusion 1.5 architecture. It is trained exclusively on CC0 images and other permissive content, aiming to produce high-quality artistic images from given text prompts. The goal is to create a robust and versatile model while ensuring the dataset used is entirely within the public domain, allowing for unrestricted use. 

A mixed technic was used to create te capitons of images, the datatset was segmented my subject and for each it was used a different method such as: GIT for realistic and photo images and CLIP for illustrations at the final everything was human correct. 
### Training Overview
**Input:** Manual captioned images at 768x
**Output:** Images  
**Architecture:** Stable Diffusion 1.5  

## Performance Limitations

CC0_rebuild_attempt may face challenges in generating highly detailed or realistic images due to the constraints of the CC0 and permissive content datasets. Additionally, the model may underperform in specific domains where high-quality, diverse CC0 images are less prevalent.

## Training Dataset Limitations
The model is trained on images and content from the following sources:
- **Pexels:** Pexels License
- **LIBRESHOT:** CC0
- **Unsplash:** Lite Dataset License
- **opengameart.org:** CC0
- **Authors:** CC0
- **Contributors:** CC0
- **Met Museum Open Access** CC0

Dataset sample:

![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/619bb3c9b392787f0f3ead14/OP3ps7Lad71u9fOAEUyAk.jpeg)


Although the dataset consists of CC0 and permissive content, in respect of individual site policies and the creators' preferences, only images that explicitly allow redistribution will be published. The remaining images will be indexed but not redistributed.

These datasets may not cover all possible themes or subjects comprehensively. The dataset may lack representation of certain modern or niche topics due to the limited availability of such content under these licenses. Additionally, the model was trained and developed with a focus on compliance with the Brazilian Copyright Act, which imposes stricter regulations compared to other jurisdictions due to the absence of fair use provisions.

It is important to note that while every effort has been made to ensure the model generates ethical and high-quality content, it is not possible to guarantee that the model will always avoid producing unwanted content or achieve the highest quality in every instance. This project represents an attempt to create an ethical model within the constraints of the available datasets and legal considerations.

Copyright laws differ from country to country, and this project acknowledges the necessity of establishing guidelines for considering public domain content. It is hoped that this research will inspire others to build more responsible models, taking into account the complexities of copyright laws and the ethical use of training data.

Contact before use in commercial projects.

## Associated Risks
* The model might struggle with generating highly detailed text within images.
* There may be limitations in creating complex scenes that require deep compositional understanding.
* The quality and diversity of generated images are dependent on the availability and variety of CC0 and permissive content.
* Potential bias towards subjects and styles that are more commonly found in CC0 and permissive content datasets and the manual capition.
* Please check OpenRAIL and make responsable use of open framework 

## Intended Uses
* Generative art and design projects
* Educational tools and research in generative AI
* Creative experimentation and artistic expression
* Reference for ethical development



## Showcase 


![IMG-20240718-WA0306.jpg](https://cdn-uploads.huggingface.co/production/uploads/619bb3c9b392787f0f3ead14/-ztWvjiM1cEHnpFTd2g1T.jpeg)

![IMG-20240618-WA0149.jpg](https://cdn-uploads.huggingface.co/production/uploads/619bb3c9b392787f0f3ead14/-E1zkaqUK68GZef1cXvi2.jpeg)

![IMG-20240724-WA0115.jpg](https://cdn-uploads.huggingface.co/production/uploads/619bb3c9b392787f0f3ead14/wX1UuYI4iEWQZhbO130Qc.jpeg)


## Authors' Note

I firmly believe that intellectual property laws have often been utilized as a means of controlling and restricting access to content, effectively acting as a form of censorship. When companies use unauthorized content, nothing happens, but for individuals, the consequences are much more severe. Many massive models have used, and continue to use, unauthorized content, and this is unlikely to change. It's a sad reality that has happened before and will keep happening. However, this is the first time these technologies are open source and widely available for everyone. Neither governments nor companies will decide the fate of society, but the work and actions of individual people. We might have to change how we think about what is considered original or creative. At the same time, this shift in perspective is crucial as we navigate the evolving landscape of AI and copyright. I hope this work inspires more responsible and ethical development practices in the field of generative AI.


## Contact 

[email protected]

```