Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I think it does way more than gpt-image-1 too?

Besides style transfer, object additions and removals, text editing, manipulation of human poses, it also supports object detection, semantic segmentation, depth/edge estimation, super-resolution and novel view synthesis (NVS) i.e. synthesizing new perspectives from a base image. It’s quite a smorgasbord!

Early results indicate to me that gpt-image-1 has a bit better sharpness and clarity but I’m honestly not sure if OpenAI doesn’t simply do some basic unsharp mask or something as a post-processing step? I’ve always felt suspicious about that, because the sharpness seems oddly uniform even in out-of-focus areas? And sometimes a bit much, even.

Otherwise, yeah this one looks about as good.

Which is impressive! I thought OpenAI had a lead here from their unique image generation solution that’d last them this year at least.

Oh, and Flux Krea has lasted four days since announcement! In case this one is truly similar in quality to gpt-image-1.



Not to mention, flux models are for non-commercial use only.


the license for flux models is $1,000/mo, hardly an obstacle to any serious commercial usage


Per 100k image. And it is additionally $0.01 per image. Considering H100 is $1.5 per hour and you can get 1 image per 5s, we are talking about bare-metal cost of ~$0.002 per image + $0.01 license cost.


The pricing seems reasonable for a SOTA class model that needs to be commercially viable or it dies.


Yeah, not trying to argue either way other than saying contrary to the parent comment, licensing cost is a significant component of the cost equation.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: