Powergen-AI (PowergenAI)

posted an update 4 days ago

Post

2086

FireRed-Image-Edit-1.0 (Rapid) Fast Experimental Demo is Out! 🚀🤗

Demo: prithivMLmods/FireRed-Image-Edit-1.0-Fast

-> Paired the EditPlusPipeline with the Diffusers-compatible transformer weights of Rapid AIO from Qwen-Image-Edit. (experimental)
-> This fusion delivers more accurate instruction following, higher image quality, and consistent visual coherence @ 4-step fast inference.
-> Better maintains text styles with high fidelity, along with high-quality old photo restoration, enhancement, and best-in-class virtual try-on.

prithivMLmods

posted an update 8 days ago

Post

3155

Try the demo for Qwen3-VL-abliterated-MAX-Fast: prithivMLmods/Qwen3-VL-abliterated-MAX-Fast (Unredacted: Ask Anything with Near-Zero Refusal Rates). The full model series collection is available here: https://huggingface.co/collections/prithivMLmods/unredacted-max-vl

prithivMLmods

posted an update 12 days ago

Post

2551

Dropping the Qwen3 VL Series of Unredacted MAX-VL models. These models have undergone multi-stage training to minimize refusal rates through continuous abliterated optimization. You can find the models in BF16, FP8-Dynamic, and GGUF formats at the links below.🔥🚀

Unredacted MAX - VL:
➜ prithivMLmods/Qwen3-VL-4B-Instruct-Unredacted-MAX
➜ prithivMLmods/Qwen3-VL-4B-Thinking-Unredacted-MAX
➜ prithivMLmods/Qwen3-VL-8B-Instruct-Unredacted-MAX
➜ prithivMLmods/Qwen3-VL-8B-Thinking-Unredacted-MAX

Unredacted MAX - VL [FP8]
➜ prithivMLmods/Qwen3-VL-4B-Instruct-Unredacted-MAX-FP8
➜ prithivMLmods/Qwen3-VL-4B-Thinking-Unredacted-MAX-FP8
➜ prithivMLmods/Qwen3-VL-8B-Instruct-Unredacted-MAX-FP8
➜ prithivMLmods/Qwen3-VL-8B-Thinking-Unredacted-MAX-FP8

Unredacted MAX - VL [GGUF]
➜ prithivMLmods/Qwen3-VL-4B-Instruct-Unredacted-MAX-GGUF
➜ prithivMLmods/Qwen3-VL-4B-Thinking-Unredacted-MAX-GGUF
➜ prithivMLmods/Qwen3-VL-8B-Instruct-Unredacted-MAX-GGUF
➜ prithivMLmods/Qwen3-VL-8B-Thinking-Unredacted-MAX-GGUF

Unredacted MAX - VL [Collection]
➜ https://huggingface.co/collections/prithivMLmods/unredacted-max-vl-fp8
➜ https://huggingface.co/collections/prithivMLmods/unredacted-max-vl
➜ https://huggingface.co/collections/prithivMLmods/unredacted-max-vl-gguf

To learn more, visit the app page or the respective model pages.

prithivMLmods

posted an update 20 days ago

Post

2922

Introducing FLUX.2-Klein-LoRA-Studio, a demo for image editing using specialized LoRA adapters built for the FLUX.2-Klein-Distilled model. It features an edit-style gallery for multi-style image editing, including de-light, face swap, mannequin, and more. Try the demo below.

🤗Demo: prithivMLmods/FLUX.2-Klein-LoRA-Studio
🤗Collection: https://huggingface.co/collections/prithivMLmods/image-generation-apps-collection
🤗GitHub: https://github.com/PRITHIVSAKTHIUR/FLUX.2-Klein-LoRA-Studio

To learn more, visit the app page or the respective model pages.

prithivMLmods

posted an update 24 days ago

Post

860

GLM OCR, a multimodal OCR model for complex document understanding, built on the GLM-V encoder–decoder architecture. It delivers high accuracy and strong generalization with a blazing-fast inference pipeline. The demo is live . Try it now. 🤗🚀

✨ Demo: prithivMLmods/GLM-OCR-Demo
✨ Multimodal Implementations: https://huggingface.co/collections/prithivMLmods/multimodal-implementations
✨ GitHub: https://github.com/PRITHIVSAKTHIUR/GLM-OCR-Demo

prithivMLmods

posted an update 25 days ago

Post

2163

Introducing the Qwen-Image-Edit-3D-Lighting-Control app, featuring 8× horizontal and 3× elevational lighting positions for precise 3D lighting control. It enables studio-level lighting using fast Qwen Image Edit fast inference, paired with Multi-Angle-Lighting adapters. 🔦

🔥 Space: prithivMLmods/Qwen-Image-Edit-3D-Lighting-Control
✅ Collection: https://huggingface.co/collections/prithivMLmods/image-generation-apps-collection
📂 GitHub: https://github.com/PRITHIVSAKTHIUR/Qwen-Image-Edit-3D-Lighting-Control

prithivMLmods

posted an update about 1 month ago

Post

3645

Daggr UI version of the Qwen3-TTS demo.🔥
(custom voice, voice design, qwen3-asr and voice cloning) nodes.
No remote spaces used for API inference; all functions run in-app fn.
Powered by t4-m and built with daggr@0.5.2 and gradio@6.

👉Demo: prithivMLmods/Qwen3-TTS-Daggr-UI
⭐Github: https://github.com/PRITHIVSAKTHIUR/Qwen3-TTS-Daggr-UI

1 reply

·

prithivMLmods

posted an update about 1 month ago

Post

2702

Qwen-Image-Edit-Object-Manipulator Space is now featured in Hugging Face Space of the Week. It enables object manipulation such as extracting objects, adding designs, and removing objects or designs from the red highlighted area using specialized adapters.

🔥Do enjoy the demo! ~ prithivMLmods/Qwen-Image-Edit-Object-Manipulator

Collections:
🧨Adapters-1: https://huggingface.co/collections/prithivMLmods/qwen-image-edit-exps
🧨Adapters-2: https://huggingface.co/collections/prithivMLmods/qie-jan-23-26
🧨Adapters-3: https://huggingface.co/collections/prithivMLmods/qwen-image-edit-object-manipulator

⭐Github: https://github.com/PRITHIVSAKTHIUR/Qwen-Image-Edit-Object-Manipulator

To learn more, visit the app page or the respective model pages.

1 reply

·

prithivMLmods

posted an update about 1 month ago

Post

3050

Introducing QIE-2511-Zoom-Master for highlight-guided area zoom-in, enabling lossless zooming within a drawn square area, and QIE-2511-Object-Remover-v2 for precise object or highlight-guided area cleanup. These experimental adapters are trained based on QIE-2511. Find the adapters below.

🕹️QIE-2511-Zoom-Master : prithivMLmods/QIE-2511-Zoom-Master
🕹️QIE-2511-Object-Remover-v2: prithivMLmods/QIE-2511-Object-Remover-v2

🤗Demo: prithivMLmods/Qwen-Image-Edit-Object-Manipulator

📂Collection: https://huggingface.co/collections/prithivMLmods/qwen-image-edit-exps

To learn more, visit the app page or the respective model pages.

2 replies

·

telcom

posted an update about 1 month ago

Post

161

A simple resume Q/A app telcom/ResumeQA

I watched this
"Stop Competing With 400 Applicants. Build This in One Weekend (Yes, there's a no code option too! " https://youtu.be/0teZqotpqT8?si=L2Vf0xrZ6_t7K7-D
and later also saw a post https://huggingface.co/posts/ZennyKenny/848353801795401

Inspired me to do a simple space. Feel free to use it.

3 replies

·

telcom

posted an update about 1 month ago

Post

1581

MAD-GRPO: https://huggingface.co/blog/telcom/mad-grpo
In R1-Zero-Like Training *, Dr.GRPO treats GRPO’s by dropping std, but that often comes with a hidden side effect: length-weighted updates that can nudge model toward verbosity.
MAD-GRPO provides robust scale (MAD + epsilon) per-token normalization stability without verbosity bias.

*https://huggingface.co/papers/2503.20783

prithivMLmods

posted an update about 2 months ago

Post

5581

LTX-2 Camera-Control LoRA demo with dolly-in/out and dolly-left/right is now available on Hugging Face, paired with ltx-2-19b-distilled-lora for fast inference. It also includes dynamic GPU duration adjustments for long video generations. Click the related Space links below.

🤗Try it now on : prithivMLmods/LTX-2-LoRAs-Camera-Control-Dolly
⭐Github: https://github.com/PRITHIVSAKTHIUR/LTX-2-LoRAs-Camera-Control-Dolly
🕹️Collection: https://huggingface.co/collections/prithivMLmods/image-generation-apps-collection

To learn more, visit the app page or the respective model pages.

2 replies

·

telcom

posted an update about 2 months ago

Post

214

CAQBI Index
is like a credit score for how an AI understands a concept.
Please check
https://huggingface.co/blog/telcom/caqbi

1 reply

·

prithivMLmods

posted an update about 2 months ago

Post

2480

Dropping Image Edit (Object Manipulator): Add or remove specified objects/designs, with flexible support for both single-image and multi-image modes.

🤗 Demo: prithivMLmods/Qwen-Image-Edit-Object-Manipulator

Qwen-Image-Edit-2511-Object-Remover is an adapter (LoRA) developed for Qwen’s Qwen-Image-Edit-2511 image-to-image model. It is specifically designed for precise object removal from images.

⭐ Model: prithivMLmods/Qwen-Image-Edit-2511-Object-Remover

Qwen-Image-Edit-2511-Object-Adder is an adapter (LoRA) developed for Qwen’s Qwen-Image-Edit-2511 image-to-image model. It is specifically designed for precise object addition to images.

⭐ Model: prithivMLmods/Qwen-Image-Edit-2511-Object-Adder

🕹️ Collection: https://huggingface.co/collections/prithivMLmods/qwen-image-edit-object-manipulator
🕹️ github: https://github.com/PRITHIVSAKTHIUR/Qwen-Image-Edit-Object-Manipulator

To learn more, visit the app page or the respective model pages.

telcom

posted an update about 2 months ago

Post

235

if you are interested in HUB (https://saemi410.github.io/HUB/ I recommend the fork I have created with some updates to make it smooth in running a smoke test git@github.com:javadtaghia/HUB.git) and you want to run the UCE (https://unified.baulab.info), please check:
- Model weights for UCE here: telcom/uce_NSFW
- Model weights for ESD here: telcom/esd_NSFW
- datasets and more download materials from: telcom/HUB_reference_dataset

Please read the notes in the model card.

prithivMLmods

posted an update 2 months ago

Post

4229

Update: TRELLIS.2 (Text to 3D, Image to 3D) Gradio with Rerun Embedded demo with improved visualization of the 3D model previewer is now available on Hugging Face. Generate assets and view them in the 3D viewer, powered and streamlined with Microsoft’s TRELLIS.2 and Tongyi-MAI’s Z-Image-Turbo models.

🤗 TRELLIS.2 (Demo): prithivMLmods/TRELLIS.2-Text-to-3D
🕹️ GitHub: https://github.com/PRITHIVSAKTHIUR/TRELLIS.2-Text-to-3D-RERUN
🕹️ Collection: https://huggingface.co/collections/prithivMLmods/multimodal-implementations

To know more about it, visit the app page or the respective model page!

prithivMLmods

posted an update 2 months ago

Post

4281

Introducing the Qwen-Image-Edit-2511-LoRAs-Fast demo, featuring image property comparison and contrast, built on top of Gradio and the combined Rerun SDK. It supports single and multi-image edits with existing LoRAs that are lazily loaded. (Note: This is still an experimental Space for Qwen-Image-Edit-2511.)

⭐ Space Demo: prithivMLmods/Qwen-Image-Edit-2511-LoRAs-Fast
⭐ GitHub: https://github.com/PRITHIVSAKTHIUR/Qwen-Image-Edit-2511-LoRAs-Fast-Multi-Image-Rerun
⭐ Collection: https://huggingface.co/collections/prithivMLmods/image-generation-apps-collection

To know more about it, visit the app page or the respective model page!

2 replies

·

telcom

posted an update 2 months ago

Post

268

NVIDIA’s Groq deal ... I think, inference efficiency is becoming the main driver of profitability, and NVIDIA’s Groq deal is evidence the market is moving from “who can train biggest” to “who can serve cheapest and fastest at scale.” That points to a maturing phase of AI, not necessarily the end of a bubble, but definitely a correction in what “wins” long-term.
What do you think?

2 replies

·

telcom

posted an update 2 months ago

Post

186

CIFAR-10 your handing image dataset ...
CIFAR-10 is a small, standard computer-vision dataset used to quickly test and compare ideas.

- 60,000 color images, each 32×32 pixels, labeled into 10 classes: airplane, automobile, bird, cat, deer, dog, frog, horse, ship, truck.
- Label mapping (important):

- 0 airplane
- 1 automobile
- 2 bird
- 3 cat
- 4 deer
- 5 dog
- 6 frog
- 7 horse
- 8 ship
- 9 truck
- Split: 50,000 train and 10,000 test.
- Why people use it: fast benchmarking for image classifiers (small CNNs, ResNet, ViT), and quick experiments for training pipelines, augmentation, regularization, pruning, distillation, and demos.
- Sizes (downloads): Python version about 163 MB, binary about 162 MB. Hugging Face shows about 144 MB for the dataset files.
- Where to get it: the official CIFAR page (University of Toronto) and the Hugging Face CIFAR-10 dataset page.
uoft-cs/cifar10
If you want something more, check the table below
| Dataset | Resolution | Classes | Best For |
| ImageNet 1K | 224–256×256 | 1000 | Real-world large-scale classification |
| ImageNet-256. | 256×256 | 1000 | Direct high-res training |
| TinyImageNet | 64×64 | 200 | Mid-range benchmark |
| UC Merced Land Use | 256×256 | ~21 | Higher resolution small classification |
| MS COCO | >256×256 | ~80 objects | Detection / segmentation |

telcom

posted an update 2 months ago

Post

2054

arXiv CS endorsement

It's Javad, my Google Scholar Profile:
https://scholar.google.com/citations?user=bja6GwoAAAAJ&hl=en
I would like to share my articles with you on Hugging Face, I'm asking for endorsement* in Computer Science arxiv.org.

If you would like to endorse me, please visit the following URL:
https://arxiv.org/auth/endorse?x=NVUAPL
If that URL does not work for you, please visit
http://arxiv.org/auth/endorse.php
and enter the following six-digit alphanumeric string:
Endorsement Code: NVUAPL

Thanks you in advance.
Javad Taghia

* Who is qualified to endorse?

To endorse another user to submit to the cs.AI (Artificial Intelligence) subject class, an arXiv submitter must have submitted 3 papers to any of cs.AI, cs.AR, cs.CC, cs.CE, cs.CG, cs.CL, cs.CR, cs.CV, cs.CY, cs.DB, cs.DC, cs.DL, cs.DM, cs.DS, cs.ET, cs.FL, cs.GL, cs.GR, cs.GT, cs.HC, cs.IR, cs.IT, cs.LG, cs.LO, cs.MA, cs.MM, cs.MS, cs.NA, cs.NE, cs.NI, cs.OH, cs.OS, cs.PF, cs.PL, cs.RO, cs.SC, cs.SD, cs.SE, cs.SI or cs.SY earlier than three months ago and less than five years ago.

AI & ML interests

Team members 40

Powergen-AI's activity