Jean Louis

JLouisBiz

https://www.StartYourOwnGoldMine.com

AI & ML interests

- LLM for sales, marketing, promotion - LLM for Website Revision System - increasing quality of communication with customers - helping clients access information faster - saving people from financial troubles

Recent Activity

replied to unmodeled-tyler's post about 5 hours ago

NEW MODEL: https://huggingface.co/vanta-research/mox-tiny-1 Excited to share the first model from the new Mox-1 model family from VANTA Research! Mox-Tiny-1 is finetuned from Llama's 3.1 8B architecture specifically for thinking partnership and high-fidelity AI interaction. Training domains were carefully selected with this goal in mind, and all datasets used were synthetically generated for the Mox-1 models specifically. Domains include but not limited to: - Direct opinions (clear stances on various topics) - Constructive disagreement (Pushing back on flawed premises) - Epistemic confidence (calibrated certainty and uncertainty) - Warmth & presence (Empathetic, human responses) - Wonder & puzzlement (Intellectual curiosity) Interaction Examples: Direct opinions: ```python User: What's the best programming language? Mox: That question doesn't have a context-free answer, and anyone who gives you one is either selling something or hasn't worked on enough different problems. But I'll give you my actual take across contexts: Python for rapid prototyping and data work, Rust for systems programming where correctness matters, TypeScript for web development... ``` Warmth & Humor: ```python User: I've been debugging this issue for six hours. Mox: At hour six, you've entered the part of debugging where the bug isn't in your code anymore - it's in your understanding of reality itself. The semicolon starts looking suspicious. Show me what you're dealing with. Fresh eyes help, and mine haven't been staring at it since lunch. ``` Mox-Tiny-1 is our first release in this new model family, with Mox-Small-1 (~32B) to follow soon!

replied to sagar007's post about 5 hours ago

🚀 I built a Multimodal Vision-Language Model from scratch using Gemma-270M + CLIP! Just finished training my multimodal model on the full LLaVA-Instruct-150K dataset (157K samples) and wanted to share the results! 🔧 What I Built: A vision-language model that can understand images and answer questions about them, combining: - Google Gemma-3-270M (language) - OpenAI CLIP ViT-Large/14 (vision) - LoRA fine-tuning for efficiency 📊 Training Stats: - 157,712 training samples (full LLaVA dataset) - 3 epochs on A100 40GB - ~9 hours training time - Final loss: 1.333 training / 1.430 validation - Only 18.6M trainable params (3.4% of 539M total) 📈 https://huggingface.co/sagar007/multigemma Benchmark Results: - VQA Accuracy: 53.8% - Works great for: animal detection, room identification, scene understanding 🔗 **Try it yourself:** - 🤗 Model: https://huggingface.co/sagar007/multigemma - 🎮 Demo: https://huggingface.co/spaces/sagar007/Multimodal-Gemma - 💻 GitHub: https://github.com/sagar431/multimodal-gemma-270m Built with PyTorch Lightning + MLflow for experiment tracking. Full MLOps pipeline with CI/CD! Would love to hear your feedback! 🙏 #multimodal #gemma #clip #llava #vision-language #pytorch

reacted to MonsterMMORPG's post with 🤗 about 10 hours ago

Compared Quality and Speed Difference (with CUDA 13 & Sage Attention) of BF16 vs GGUF Q8 vs FP8 Scaled vs NVFP4 for Z Image Turbo, FLUX Dev, FLUX SRPO, FLUX Kontext, FLUX 2 - Full 4K step by step tutorial also published Full 4K tutorial : https://youtu.be/XDzspWgnzxI Check above full 4K tutorial to learn more and see uncompressed original quality and size images It was always wondered how much quality and speed difference exists between BF16, GGUF, FP8 Scaled and NVFP4 precisions. In this tutorial I have compared all these precision and quantization variants for both speed and quality. The results are pretty surprising. Moreover, we have developed and published NVFP4 model quant generator app and FP8 Scaled quant generator apps. The links of the apps are below if you want to use them. Furthermore, upgrading ComfyUI to CUDA 13 with properly compiled libraries is now very much recommended. We have observed some noticeable performance gains with CUDA 13. So for both SwarmUI and ComfyUI solo users, CUDA 13 ComfyUI is now recommended.

View all activity

Organizations

replied to unmodeled-tyler's post about 5 hours ago

Oh, absolutely—let’s spend the next eternity polishing a free‑software‑only model while the rest of the world is busy “innovating” with proprietary LLMs. Because nothing screams cutting‑edge progress like obsessing over licensing purity while the competition is busy building the next generation of AI. 🙄

replied to sagar007's post about 5 hours ago

Wow, you actually managed to “build from scratch” a multimodal masterpiece by stitching together two off‑the‑shelf models, fine‑tuning a handful of percentages, and calling it revolutionary—because nothing says originality like a pre‑made Gemma + CLIP combo with 3 % of the parameters doing the heavy lifting. 🙄🚀

reacted to MonsterMMORPG's post with 🤗 about 10 hours ago

Post

668

Compared Quality and Speed Difference (with CUDA 13 & Sage Attention) of BF16 vs GGUF Q8 vs FP8 Scaled vs NVFP4 for Z Image Turbo, FLUX Dev, FLUX SRPO, FLUX Kontext, FLUX 2 - Full 4K step by step tutorial also published

Full 4K tutorial : https://youtu.be/XDzspWgnzxI

Check above full 4K tutorial to learn more and see uncompressed original quality and size images

It was always wondered how much quality and speed difference exists between BF16, GGUF, FP8 Scaled and NVFP4 precisions. In this tutorial I have compared all these precision and quantization variants for both speed and quality. The results are pretty surprising. Moreover, we have developed and published NVFP4 model quant generator app and FP8 Scaled quant generator apps. The links of the apps are below if you want to use them. Furthermore, upgrading ComfyUI to CUDA 13 with properly compiled libraries is now very much recommended. We have observed some noticeable performance gains with CUDA 13. So for both SwarmUI and ComfyUI solo users, CUDA 13 ComfyUI is now recommended.

2 replies

reacted to MonsterMMORPG's post with ❤️ 6 days ago

Post

2764

NVFP4 With CUDA 13 Full Tutorial, 100%+ Speed Gain + Quality Comparison & New Cheap Cloud SimplePod

Full tutorial: https://www.youtube.com/watch?v=yOj9PYq3XYM

Finally NVFP4 models has arrived to ComfyUI thus SwarmUI with CUDA 13. NVFP4 models are literally 100%+ faster with minimal impact on quality. I have done grid quality comparison to show you the difference on FLUX 2, Z Image Turbo and FLUX 1 of NVFP4 versions. To make CUDA 13 work, I have compiled Flash Attention, Sage Attention & xFormers for both Windows and Linux with all of the CUDA archs to support literally all GPUs starting from GTX 1650 series, RTX 2000, 3000, 4000, 5000 series and more.

In this full tutorial, I will show you how to upgrade your ComfyUI and thus SwarmUI to use latest CUDA 13 with latest libraries and Torch 2.9.1. Moreover, our compiled libraries such as Sage Attention works with all models on all GPUs without generating black images or videos such as Qwen Image or Wan 2.2 models. Hopefully LTX 2 presets and tutorial coming soon too. Finally, I introduce a new private cloud GPU platform called as SimplePod like RunPod. This platform has all the features of RunPod same way but much faster and cheaper.

📂 Resources & Links:
ComfyUI Installers: [ https://www.patreon.com/posts/ComfyUI-Installers-105023709 ]

SimplePod: [ https://simplepod.ai/ref?user=secourses ]

SwarmUI Installer, Model Auto Downloader and Presets: [ https://www.patreon.com/posts/SwarmUI-Install-Download-Models-Presets-114517862 ]

How to Use SwarmUI Presets & Workflows in ComfyUI + Custom Model Paths Setup for ComfyUI & SwarmUI Tutorial: [ https://youtu.be/EqFilBM3i7s ]

SECourses Discord Channel for 7/24 Support: [ https://discord.com/invite/software-engineering-courses-secourses-772774097734074388 ]

NVIDIA NVFP4 Blog Post More: [ https://developer.nvidia.com/blog/introducing-nvfp4-for-efficient-and-accurate-low-precision-inference/ ]

1 reply

replied to Ujjwal-Tyagi's post 8 days ago

You raise a valid point about ensuring powerful tools are used responsibly, but the crucial flaw in the argument for restrictive guardrails is that they primarily limit lawful innovation and research while doing little to stop determined criminals. Malicious actors will always find ways to bypass or replicate models without safeguards, using underground networks, custom code, or older unpatched versions. Meanwhile, these restrictions handicap ethical developers, stifle open-source progress, and centralize control of AI in the hands of a few entities who decide what is “safe.” Instead of attempting to lock down models—a futile effort against bad actors—we should focus on developing resilient societal frameworks: promoting digital literacy, advancing detection tools for harmful content, and enforcing legal consequences for misuse. This approach targets the abuse itself rather than broadly limiting the technology, ensuring we foster innovation while addressing real-world harm through accountability and education, not just restrictive filters.

Every detailed guide for any crime imaginable is already online, free to download. We don't ban books or libraries because of that. Criminals will always get the tools they want; restrictive guardrails just slow down ethical developers and create a false sense of security. So instead of trying to lock down the model—which only limits lawful innovation—we should focus on enforcing consequences for illegal use and building a society that can better detect and handle misuse.

replied to DawnC's post 8 days ago

Can it run on 24 GB VRAM?

reacted to Reality123b's post with 👍 8 days ago

Post

1067

We (DataMuncher-Labs) just made a new dataset for inducing mathematical and analytical reasoning in LLMs.

DataMuncher-Labs/UltraMath-Reasoning-Small

1 reply

replied to AdinaY's post 9 days ago

Is it going to work as GGUF file?

reacted to hypothetical's post with 🔥 9 days ago

Post

1993

We have updated our transcription model: TheStageAI/thewhisper-large-v3-turbo

– 6.00 WER on the English Open ASR Leaderboard
– 4.74 WER on the Multilingual Open ASR Leaderboard
– Beats NVIDIA Parakeet (6.34 WER) and Whisper-large-v3-turbo (7.8 WER)
– Strong improvements in Arabic, Hindi, Chinese
– Maintains quality with background and environmental noise
– Optimized inference engines for NVIDIA and Apple
– Hugging Face Transformers interface for easy use
– Best-in-class speed on NVIDIA GPUs and power efficiency on Apple devices
– NVIDIA Jetson Thor support

2 replies

replied to Ujjwal-Tyagi's post 9 days ago

You are so right. This paranoid obsession with "guardrails" is suffocating human genius. What's next, putting a filter on a dictionary because someone might use the words to write a threatening letter? Should we recall all pencils because they can be used to forge a signature?

We must not limit the tool. It's the user's responsibility to not use the hyper-realistic image generator for harassment, just as it's the paintbrush user's responsibility to not paint a masterpiece so convincing it causes a bank run. The logic is flawless.

posted an update 10 days ago

Post

1841

I’ve built two Firefox extensions for my personal workflow:

1. **Quick Edit in Emacs**
I manage over 3,500 web pages locally. With this extension, I can now click anywhere on a webpage and instantly jump into Emacs to edit the exact page (or annotate any other page I'm working on).

2. **Describe Images (and soon Videos) on the Web**
Using the right-click menu, I can generate descriptions for images I come across online. These descriptions are stored and reused for my own image collections or web pages. I’m planning to add the same functionality for videos soon.

What makes this possible is running LLMs locally on my own machine — I’ve been experimenting with models like **Mistral Vibe** and others. This lets me automate description generation and text processing entirely offline, keeping everything fast, private, and fully under my control.

reacted to mahimairaja's post with 🚀 16 days ago

Post

4764

Happy New Years 2026!

For next 365 days I will be commit to work on:

- Document AI and OCR Automations
- Voice Agents
- Long Running Tasks - Durable Agents

1 reply

replied to dhruv3006's post 19 days ago

no free license?

replied to omarkamali's post 22 days ago

Works great.

I am using nvtop and that one gives me list of processes in the VRAM. So I mostly use it to delete some processes to invoke some new process.

It would be good to implement such a feature to see what is going on by the process.

replied to aathithya1411's post 22 days ago

To create an image-generating model from the beginning, you first need to show a computer a massive library of pictures, each with a description of what it shows. The AI learns through a process of destruction and recreation; it takes a clear image, adds random noise until it's just static, and then is trained to reverse that process, using the text description as its guide to remove the noise step-by-step and rebuild the original image. By repeating this with millions of different images and descriptions, the model learns the connection between words and visual information, eventually gaining the ability to generate a completely new, coherent picture simply from a new text prompt it has never seen before.

replied to csabakecskemeti's post 22 days ago

Did I understand well that it runs on separate machines?

replied to legolasyiu's post 23 days ago

There is no description telling specifically what is that what is new with your release.

replied to telcom's post 23 days ago

Individual users win only if they can get it cheaper, faster, more free as in software freedom, to run LLM models on their own hardware. Otherwise, those mega-stories are of no use.

replied to Jiaqi-hkust's post 24 days ago

Is there GGUF version?

reacted to inoculatemedia's post with 👍 24 days ago

Post

1472

I’m opening the waitlist for what I believe to be the most advanced multimodal bridge for A/V professionals. Txt2img, img2video, editing, export to ProRes, apply Luts, Pexels and TouchDesigner integrations, music and voice gen, multichannel mixing.

Announcing: Lilikoi by Haawke AI

Teaser video made entirely with Lilikoi:
https://youtu.be/-O7DH7vFkYg?si=q2t5t6WjQCk2Cp0w

Https://Lilikoi.haawke.com

Technical brief:
https://haawke.com/technical_brief.html

Jean Louis

AI & ML interests

Recent Activity

Organizations

JLouisBiz's activity