Hacker Newsnew | past | comments | ask | show | jobs | submit | bryced's commentslogin

Asked this and related questions to o3. I do not vouch for the answers at all but you may find it interesting. https://chatgpt.com/share/6876cdd1-dfbc-8011-a55f-6915a90275...


> would it be possible to use this tool to make automatic mask for editing in something like GIMP

probably but GIMP plugins are not something I've looked into

> REPL

already done. just type `aimg` and you're good to go

> GUI

GUIs add a lot of complexity. Can your file manager do thumbnails and quick previews?


> GUIs add a lot of complexity. Can your file manager do thumbnails and quick previews?

Somewhat OT, but I find this really funny. It says a lot about the difficulty of using various ecosystems and where communities spend time polishing things.

"Yeah, I made something that takes natural language and can do things like change seasons in an image. But a GUI? That's complicated!"

It's not a criticism of you, but the different ecosystems and what programmers like to focus on nowadays.


Fair but I'd point out I also didn't make the algorithm that changes photos. I'm wrapping a bunch of algorithms that other people made in a way that makes them easy to use.

It's not just that GUI's are hard, it's that the "customer" base will inevitably be much less technical and I'd receive a lot more difficult to resolve bug reports. So no-gui is also a way of staying focused on more interesting parts of the project.


thanks for the quick answer and cool for REPL. Yeah sure I can just launch Gwenview on the output directory.

> probably but GIMP plugins are not something I've looked into

I was just thinking about a black and white or grey level output image with the desired area , no need to integrate it in GIMP of whatever. I've tried a prompt like "keep only the face", but no luck so far.


There is a smart mask feature. Add `--mask-prompt face --mask-mode keep`. I believe it outputs the masks as well


I'm told Imaginairy secretly does run on Intel Mac. very slowly. I just don't want to be on the hook for support so the reqs are written that way.


Ooh, I'll give it a whirl - performance isn't a priority right now. Thanks Bryce.


This is itself it's own finetuned version of SD so now it won't work with alternative versions. img2img works by just running normal stable diffusion img2img on a noised starting image. As such it destroys information at all parts of the image equally. This new model uses attention mechanisms to decide which parts of the image are important to modify. It can leave parts of the image untouched while making drastic changes to other parts.


Well, to be fair you can use feathered bitmap masks for img2img with some UIs (automatic1111)


Perfect explanation, thank you!


The console output tells you what happened.


Here is a google colab you can try it in: https://colab.research.google.com/drive/1rOvQNs0Cmn_yU1bKWjC...


Here is a colab you can try it in. It crashed for me the first time but worked the second time. https://colab.research.google.com/drive/1rOvQNs0Cmn_yU1bKWjC...


I could not get the first cell to run.

ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts. tensorflow 2.9.2 requires protobuf<3.20,>=3.9.2, but you have protobuf 3.20.3 which is incompatible. tensorboard 2.9.1 requires protobuf<3.20,>=3.9.2, but you have protobuf 3.20.3 which is incompatible.


I believe you can just ignore that error. The cell ran.


EDIT - it's free of charge: https://research.google.com/colaboratory/faq.html

---

First time I've used "colab" - looks great. Out of interest, who pays for the compute used by this?

Is it freely offerred by Google? Or is it charged to my Google API account when I use it? Or your account? It wasn't clear in the UI.


Freely offered by Google. They offer a subscription model if you want to run your collab notebooks on a beefier machine.


Huh, I'm trying it now and the results seem so weak compared to any other model I've seen since dall-e.


Hmm that’s true for me too. Not sure if it is due to resource constraint. I had a picture of a car in indoor parking lot with walls and pillars. When I prompted “Color the car blue” the whole image was drenched in a tint of blue. Similarly when prompted “make a hummingbird hover” … the hummingbirds were a patch of shiny colors with an shape that sort of looked like an hummingbird but not like a real one.


Try "turn the car blue"


Does dalle do prompt based photo edits now?

But yeah sometimes it doesn't follow directions well. I haven't noticed a pattern yet for why that is.


How would I upgrade to 2.1 if running locally?


If you're wanting to use Stable Diffusion 2.1 with imaginairy you just specify the model with `--model SD-2.1` when running the `aimg imagine` command.


Sorry for the offtopicness but could you please email me at hn@ycombinator.com? (I have an idea/possibility for you. Nothing that couldn't be posted here in principle but I'd rather not take the thread off topic.)


This was an uncanny comment, somehow.

Hope ya'll brainstorming session is fruitful.


Sorry for the uncanning! My thought was simply to connect the OP with a YC partner in case they wanted to explore doing this as a startup.

I send such emails all the time but on semi-rare occasions have to resort to offtopic pleas like the GP.

I hope that helps clear things up!


It does, thank you!


It does work in non-pro colab apparently. Here you go: https://colab.research.google.com/drive/1rOvQNs0Cmn_yU1bKWjC...


If you're used to installing python packages it should be relatively easy. There are other projects with nice UIs but that's not what this library is for.


I tried it out :-)

`aimg edit assets/girl_with_a_pearl_earring.jpg "make it pop" --prompt-strength 40 --gif`

https://user-images.githubusercontent.com/1217531/213912442-...


I think it tried to make it pop art? Which is not a bad response to be fair.


I was expecting something balloon-like to appear, and was not disappointed.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: