| | --- |
| | tags: |
| | - text-to-image |
| | - lora |
| | - diffusers |
| | - template:diffusion-lora |
| | base_model: black-forest-labs/FLUX.1-Kontext-dev |
| | instance_prompt: >- |
| | [photo content], render the image from the left-side perspective, keeping |
| | consistent lighting, textures, and proportions. Maintain the realism of all |
| | surrounding elements while revealing previously unseen left-side details |
| | consistent with the object’s or scene’s structure. |
| | license: other |
| | license_name: flux-1-dev-non-commercial-license |
| | license_link: LICENSE.md |
| | language: |
| | - en |
| | pipeline_tag: image-to-image |
| | library_name: diffusers |
| | --- |
| | |
| |  |
| |
|
| | # **Kontext-CAM-Left-View** |
| |
|
| | The Kontext-CAM-Left-View is an experimental adapter for black-forest-lab's FLUX.1-Kontext-dev, designed to generate a left-side perspective of the scene while preserving consistent lighting, textures, and proportions. The model maintains the realism of all surrounding elements and accurately reveals previously unseen left-side details, ensuring seamless perspective alignment and environmental coherence. It was trained on 800 image pairs (400 start images and 400 end images) to deliver high-fidelity, geometry-consistent left-side viewpoint generation. |
| |
|
| | > [!note] |
| | [photo content], render the image from the left-side perspective, keeping consistent lighting, textures, and proportions. Maintain the realism of all surrounding elements while revealing previously unseen left-side details consistent with the object’s or scene’s structure. |
| |
|
| | > You modified the prompt, altering its properties and subjective elements. Note: this is an experimental adapter and may contain artifacts. |
| |
|
| | --- |
| |
|
| | ## **Sample Inferences : Demo** |
| |
|
| | <table style="width:100%; border-collapse:collapse;"> |
| | <tr> |
| | <td style="width:50%; text-align:center;"> |
| | <img src="https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/lZ8asnkoamFUH1ClFgn6H.jpeg" |
| | alt="Kontext-CAM-Left-View" style="width:100%; height:auto;"/> |
| | </td> |
| | <td style="width:50%; text-align:center;"> |
| | <img src="https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/F92WuRNLReDYS-nXXBLUz.webp" |
| | alt="Kontext-CAM-Left-View" style="width:100%; height:auto;"/> |
| | </td> |
| | </tr> |
| | </table> |
| | |
| | <table style="width:100%; border-collapse:collapse;"> |
| | <tr> |
| | <td style="width:50%; text-align:center;"> |
| | <img src="https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/Txk4Mnk7q6wkGFdpe276J.jpeg" |
| | alt="Kontext-CAM-Left-View" style="width:100%; height:auto;"/> |
| | </td> |
| | <td style="width:50%; text-align:center;"> |
| | <img src="https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/uHdGFGI-4plezer-JSAjs.webp" |
| | alt="Kontext-CAM-Left-View" style="width:100%; height:auto;"/> |
| | </td> |
| | </tr> |
| | </table> |
| | |
| | --- |
| |
|
| | ## Parameter Settings |
| |
|
| | | Setting | Value | |
| | | ------------------------ | ------------------------ | |
| | | Module Type | Adapter | |
| | | Base Model | FLUX.1 Kontext Dev - fp8 | |
| | | Trigger Words | [photo content], render the image from the left-side perspective, keeping consistent lighting, textures, and proportions. Maintain the realism of all surrounding elements while revealing previously unseen left-side details consistent with the object’s or scene’s structure. | |
| | | Image Processing Repeats | 42 | |
| | | Epochs | 22 | |
| | | Save Every N Epochs | 1 | |
| |
|
| | Labeling: DeepCaption-VLA-7B(natural language & English) |
| | |
| | Total Images Used for Training : 800 Image Pairs (400 Start, 400 End) |
| | |
| | ## Training Parameters |
| |
|
| | | Setting | Value | |
| | | --------------------------- | --------- | |
| | | Seed | - | |
| | | Clip Skip | - | |
| | | Text Encoder LR | 0.00001 | |
| | | UNet LR | 0.00005 | |
| | | LR Scheduler | constant | |
| | | Optimizer | AdamW8bit | |
| | | Network Dimension | 64 | |
| | | Network Alpha | 32 | |
| | | Gradient Accumulation Steps | - | |
| |
|
| | ## Label Parameters |
| |
|
| | | Setting | Value | |
| | | --------------- | ----- | |
| | | Shuffle Caption | - | |
| | | Keep N Tokens | - | |
| |
|
| | ## Advanced Parameters |
| |
|
| | | Setting | Value | |
| | | ------------------------- | ----- | |
| | | Noise Offset | 0.03 | |
| | | Multires Noise Discount | 0.1 | |
| | | Multires Noise Iterations | 10 | |
| | | Conv Dimension | - | |
| | | Conv Alpha | - | |
| | | Batch Size | - | |
| | | Steps | 3300 & 400(warm up) | |
| | | Sampler | euler | |
| |
|
| | --- |
| |
|
| | ## Trigger words |
| |
|
| | You should use `[photo content]` to trigger the image generation. |
| |
|
| | You should use `render the image from the left-side perspective` to trigger the image generation. |
| |
|
| | You should use `keeping consistent lighting` to trigger the image generation. |
| |
|
| | You should use `textures` to trigger the image generation. |
| |
|
| | You should use `and proportions. Maintain the realism of all surrounding elements while revealing previously unseen left-side details consistent with the object’s or scene’s structure.` to trigger the image generation. |
| |
|
| | ## Download model |
| |
|
| | [Download](/prithivMLmods/Kontext-CAM-Left-View/tree/main) them in the Files & versions tab. |