ginipick commited on
Commit
ee19fd5
ยท
verified ยท
1 Parent(s): 0345ed6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +95 -1
README.md CHANGED
@@ -10,5 +10,99 @@ pinned: false
10
  license: mit
11
  short_description: input text, a video from the past to the future
12
  ---
 
13
 
14
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  license: mit
11
  short_description: input text, a video from the past to the future
12
  ---
13
+ Looking at this code, it's a Gradio-based application that generates interpolated images between two concepts using CLIP-guided diffusion with the FLUX model. Let me explain the key components and functionality:
14
 
15
+ ## English Explanation
16
+
17
+ ### Overview
18
+ This application creates a "Time Stream" effect by generating a series of images that smoothly transition between two different states or concepts. For example, it can show the progression from a "fresh" tomato to a "rotten" one, creating a time-lapse-like visualization.
19
+
20
+ ### Key Features
21
+
22
+ 1. **CLIP-Guided Image Generation**
23
+ - Uses FLUX.1-schnell model with CLIP guidance
24
+ - Finds latent directions between two concepts using CLIP embeddings
25
+ - Generates intermediate images along this direction
26
+
27
+ 2. **Main Components**
28
+ - **Prompt**: The base description of what to generate
29
+ - **1st/2nd Direction**: Two states to interpolate between (e.g., "Fresh" โ†’ "Rotten")
30
+ - **Strength**: Controls how extreme the transformation is
31
+ - **Output**: Creates both an image strip and a looping video
32
+
33
+ 3. **Advanced Options**
34
+ - Number of intermediate images (3-65)
35
+ - CLIP direction iterations (0-400)
36
+ - Inference steps (1-4)
37
+ - Guidance scale (0.1-10.0)
38
+ - Seed control for reproducibility
39
+
40
+ 4. **Output Formats**
41
+ - Individual generated images
42
+ - Image strip showing all transitions
43
+ - Looping video of the transformation
44
+ - Interactive slider to view specific frames
45
+
46
+ ### Technical Implementation
47
+ - Uses `spaces.GPU` decorator for GPU acceleration
48
+ - Implements AutoencoderTiny for faster processing
49
+ - Handles Korean text detection (though warns it's used directly without translation)
50
+ - Saves images with unique UUID filenames
51
+
52
+ ### Example Use Cases
53
+ - Showing decay/aging processes
54
+ - Seasonal changes
55
+ - Weather transitions
56
+ - Urban development/deterioration
57
+ - Any temporal transformation
58
+
59
+ ---
60
+
61
+ ## ํ•œ๊ธ€ ์„ค๋ช…
62
+
63
+ ### ๊ฐœ์š”
64
+ ์ด ์• ํ”Œ๋ฆฌ์ผ€์ด์…˜์€ ๋‘ ๊ฐ€์ง€ ๋‹ค๋ฅธ ์ƒํƒœ๋‚˜ ๊ฐœ๋… ์‚ฌ์ด๋ฅผ ๋ถ€๋“œ๋Ÿฝ๊ฒŒ ์ „ํ™˜ํ•˜๋Š” ์ผ๋ จ์˜ ์ด๋ฏธ์ง€๋ฅผ ์ƒ์„ฑํ•˜์—ฌ "์‹œ๊ฐ„์˜ ํ๋ฆ„(Time Stream)" ํšจ๊ณผ๋ฅผ ๋งŒ๋“ญ๋‹ˆ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด, "์‹ ์„ ํ•œ" ํ† ๋งˆํ† ์—์„œ "์ฉ์€" ํ† ๋งˆํ† ๋กœ์˜ ๋ณ€ํ™” ๊ณผ์ •์„ ๋ณด์—ฌ์ฃผ๋Š” ์‹œ๊ฐ„ ๊ฒฝ๊ณผ ์‹œ๊ฐํ™”๋ฅผ ์ƒ์„ฑํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
65
+
66
+ ### ์ฃผ์š” ๊ธฐ๋Šฅ
67
+
68
+ 1. **CLIP ๊ฐ€์ด๋“œ ์ด๋ฏธ์ง€ ์ƒ์„ฑ**
69
+ - CLIP ๊ฐ€์ด๋˜์Šค์™€ ํ•จ๊ป˜ FLUX.1-schnell ๋ชจ๋ธ ์‚ฌ์šฉ
70
+ - CLIP ์ž„๋ฒ ๋”ฉ์„ ์‚ฌ์šฉํ•˜์—ฌ ๋‘ ๊ฐœ๋… ์‚ฌ์ด์˜ ์ž ์žฌ ๋ฐฉํ–ฅ ์ฐพ๊ธฐ
71
+ - ์ด ๋ฐฉํ–ฅ์„ ๋”ฐ๋ผ ์ค‘๊ฐ„ ์ด๋ฏธ์ง€๋“ค์„ ์ƒ์„ฑ
72
+
73
+ 2. **์ฃผ์š” ๊ตฌ์„ฑ ์š”์†Œ**
74
+ - **ํ”„๋กฌํ”„ํŠธ**: ์ƒ์„ฑํ•  ๋Œ€์ƒ์˜ ๊ธฐ๋ณธ ์„ค๋ช…
75
+ - **1์ฐจ/2์ฐจ ๋ฐฉํ–ฅ**: ๋ณด๊ฐ„ํ•  ๋‘ ๊ฐ€์ง€ ์ƒํƒœ (์˜ˆ: "์‹ ์„ ํ•œ" โ†’ "์ฉ์€")
76
+ - **๊ฐ•๋„**: ๋ณ€ํ™˜์˜ ๊ทน๋‹จ์„ฑ์„ ์ œ์–ด
77
+ - **์ถœ๋ ฅ**: ์ด๋ฏธ์ง€ ์ŠคํŠธ๋ฆฝ๊ณผ ๋ฃจํ•‘ ๋น„๋””์˜ค ๋ชจ๋‘ ์ƒ์„ฑ
78
+
79
+ 3. **๊ณ ๊ธ‰ ์˜ต์…˜**
80
+ - ์ค‘๊ฐ„ ์ด๋ฏธ์ง€ ์ˆ˜ (3-65๊ฐœ)
81
+ - CLIP ๋ฐฉํ–ฅ ๋ฐ˜๋ณต ํšŸ์ˆ˜ (0-400ํšŒ)
82
+ - ์ถ”๋ก  ๋‹จ๊ณ„ (1-4๋‹จ๊ณ„)
83
+ - ๊ฐ€์ด๋˜์Šค ์Šค์ผ€์ผ (0.1-10.0)
84
+ - ์žฌํ˜„์„ฑ์„ ์œ„ํ•œ ์‹œ๋“œ ์ œ์–ด
85
+
86
+ 4. **์ถœ๋ ฅ ํ˜•์‹**
87
+ - ๊ฐœ๋ณ„ ์ƒ์„ฑ ์ด๋ฏธ์ง€
88
+ - ๋ชจ๋“  ์ „ํ™˜์„ ๋ณด์—ฌ์ฃผ๋Š” ์ด๋ฏธ์ง€ ์ŠคํŠธ๋ฆฝ
89
+ - ๋ณ€ํ™˜ ๊ณผ์ •์˜ ๋ฃจํ•‘ ๋น„๋””์˜ค
90
+ - ํŠน์ • ํ”„๋ ˆ์ž„์„ ๋ณผ ์ˆ˜ ์žˆ๋Š” ์ธํ„ฐ๋ž™ํ‹ฐ๋ธŒ ์Šฌ๋ผ์ด๋”
91
+
92
+ ### ๊ธฐ์ˆ ์  ๊ตฌํ˜„
93
+ - GPU ๊ฐ€์†์„ ์œ„ํ•œ `spaces.GPU` ๋ฐ์ฝ”๋ ˆ์ดํ„ฐ ์‚ฌ์šฉ
94
+ - ๋น ๋ฅธ ์ฒ˜๋ฆฌ๋ฅผ ์œ„ํ•œ AutoencoderTiny ๊ตฌํ˜„
95
+ - ํ•œ๊ธ€ ํ…์ŠคํŠธ ๊ฐ์ง€ ์ฒ˜๋ฆฌ (๋ฒˆ์—ญ ์—†์ด ์ง์ ‘ ์‚ฌ์šฉ๋œ๋‹ค๋Š” ๊ฒฝ๊ณ  ํ‘œ์‹œ)
96
+ - ๊ณ ์œ ํ•œ UUID ํŒŒ์ผ๋ช…์œผ๋กœ ์ด๋ฏธ์ง€ ์ €์žฅ
97
+
98
+ ### ์‚ฌ์šฉ ์˜ˆ์‹œ
99
+ - ๋ถ€ํŒจ/๋…ธํ™” ๊ณผ์ • ํ‘œํ˜„
100
+ - ๊ณ„์ ˆ ๋ณ€ํ™”
101
+ - ๋‚ ์”จ ์ „ํ™˜
102
+ - ๋„์‹œ ๊ฐœ๋ฐœ/์‡ ํ‡ด
103
+ - ๋ชจ๋“  ์‹œ๊ฐ„์  ๋ณ€ํ™˜
104
+
105
+ ### ์ฐธ๊ณ ์‚ฌํ•ญ
106
+ - ํ•œ๊ธ€ ์ž…๋ ฅ์€ ์ง€์›๋˜์ง€๋งŒ ๋ชจ๋ธ์ด ์˜์–ด์— ์ตœ์ ํ™”๋˜์–ด ์žˆ์–ด ๊ฒฐ๊ณผ๊ฐ€ ์ œํ•œ์ ์ผ ์ˆ˜ ์žˆ์Œ
107
+ - ๊ฐ•๋„(Strength) ๊ฐ’์ด 2.5 ์ด์ƒ์ผ ๊ฒฝ์šฐ ๋ถˆ์•ˆ์ •ํ•  ์ˆ˜ ์žˆ์Œ
108
+ - ์ค‘๊ฐ„ ์ด๋ฏธ์ง€ ์ˆ˜๊ฐ€ ๋งŽ์„์ˆ˜๋ก ๋” ๋ถ€๋“œ๋Ÿฌ์šด ์ „ํ™˜ ํšจ๊ณผ๋ฅผ ์–ป์„ ์ˆ˜ ์žˆ์Œ