464 points by modinfo 6 days ago | 204 comments | View on ycombinator
lumost 6 days ago |
flashman 5 days ago |
mk_stjames 6 days ago |
and so now I'm wondering how cool /fast / compressed a diffusion image generator could be if the images it was trained on / space it worked in was limited to 1 bit (Floyd-Steinberg / Atkinson / your favorite algo here) dithered images.
Training would surely be pretty quick and probably fit onto one modern GPU.
mft_ 6 days ago |
IME, the bottleneck when using diffusion models isn't storage space or memory, it's generation time. Lots of models will run on 8-12 GB 1080-generation GPUs onwards, or on Macs with similar memory, which are probably the bottom end from a GPU power perspective anyway. I also note that these models are marginally slower than the small FLUX.2 model they're based on.
Okay, maybe this allows running a local model on something that has a reasonably powerful GPU and limited memory, like an iPhone, but is that really a common requirement?
liuliu 6 days ago |
This is wrong. But they worded it carefully to be not entirely wrong.
FLUX.2 [klein] 4B (the same parameter class, basically the same model) runs on iPhone through Draw Things app, with 8-bit or 6-bit quantization (hence not "directly", I guess, but that is the technicality that sounds fishy enough).
hmokiguess 5 days ago |
sorenjan 6 days ago |
ttul 6 days ago |
kordlessagain 5 days ago |
smallerize 6 days ago |
Isn't SD XL 3.5B? And the refiner model is even larger. Those can run on an iPhone 13 Pro.
jeroenhd 6 days ago |
I do wonder how these compare to existing image generation models. I've tried https://github.com/alichherawalla/off-grid-mobile-ai for a while but I find the image generation models rather lacking.
MitPitt 6 days ago |
a1o 6 days ago |
willXare 4 days ago |
wiradikusuma 6 days ago |
sroussey 5 days ago |
cadamsdotcom 6 days ago |
Sadly right now the expensive developer subscription means the few folks willing to hold a forever subscription make something that barely works then move on… or make something with so many ads it is an app. For example Google’s “Model Garden” app has no ads but still has major UX issues and isn’t suitable for daily use, even though the models are amazing.
Raising awareness of how capable today’s phone hardware is will make normal people demand to run what they choose on their phones. It’d be a much stronger way back to general purpose computing than via all legislation that has been tried so far..
moralestapia 6 days ago |
potatoman22 6 days ago |
vorticalbox 5 days ago |
https://huggingface.co/spaces/webml-community/bonsai-image-w...
junto 6 days ago |
Led me to wonder what happens if a domain gets a new owner, and they want to petition Apple to remove the block.
willXare 4 days ago |
captainregex 6 days ago |
kordlessagain 5 days ago |
undefined 5 days ago |
willXare 4 days ago |
SilentM68 6 days ago |
Is it compatible with Ollama, ComfyUI or are those providers unneeded, compatible with low-end hardware?
Also, where does "./setup.sh/ drop the components in Linux?
Thank you, Sol
jijji 6 days ago |
n3xyf 5 days ago |
undefined 4 days ago |
dbcooper 6 days ago |
sudb 6 days ago |
edf13 5 days ago |
Website Not Allowed “prismml.com” is a restricted website.
iJohnDoe 6 days ago |
janniks 6 days ago |
lwansbrough 5 days ago |
I can think of a lot of positives. The negatives amount to a convoluted argument about the limits of free speech.
yieldcrv 6 days ago |
having trouble loading the webgl browser demo on my phone but no biggy
woadwarrior01 6 days ago |
danielEM 6 days ago |
baisampayans 5 days ago |
Songjinhao 5 days ago |
maephisto666 6 days ago |
huflungdung 6 days ago |
There are many problems I want to work on which require billions of tokens. These are completely inaccessible without corporate project sponsorship at the moment. An asic generation machine which can pump out a few 10s of thousands of tokens per second at opus4.6 quality is more than sufficient.