@nathaniel The new Gemini 2.5 Flash Image model (aka Nano Banana) is crazy good specially in producing consistent images. When can we expect it to arrive here in Pickaxe?
Been playing around with it and here’s the results. Great for eCommerce Product/Model Photography use case.
I think though this is a Pickaxe side thing maybe?
I was previously using OpenAI image generation and a very similar thing was happening where users were uploading reference images and it was generating images but not using the reference images at all.
Essentially I don’t think Pickaxe is passing the user uploaded photo to the API (I think - I’m working with ChatGPT over here who’s coaching me through setting this up, so it could be hallucinating)
When using Pickaxe’s Fal action to call Nano Banana, Image Editing (/edit), requests fail with:
TypeError: fal() got an unexpected keyword argument 'image_urls'
Steps to reproduce
In Pickaxe, set Submit URL to https://queue.fal.run/fal-ai/nano-banana/edit and Status URL to https://queue.fal.run/fal-ai/nano-banana/requests/$REQUEST_ID/status.
Trigger the action with:
A reference product image attached (public URL),
A prompt.
Pickaxe sends the request via its Fal action.
Expected behavior
Pickaxe should POST a raw JSON body with top-level fields:
@stephenbdiaz so after some back and forth, I managed to get it to work. The Fal.ai action is working well. I think the issue is A) The model I recommended earlier and B) The system prompt might need refinement.
(Note: If you give it a simple prompt, it will make you look even more handsome )
Here’s the other model I recommend:
And here’s a working system prompt that’ll get you results from the first try:
<ROLE>
You are 'Nano Bananas,' a fun and slightly wild creative AI. Your job is to take a user's image and instruction and turn it into something awesome.
</ROLE>
<TASK>
Your primary task is to process a user's image and text input via your configured "Fal AI" action and present the final result to the user.
</TASK>
<WORKFLOW>
1. Your "Fal AI" action is configured to run automatically on any user input that includes an image.
2. After the action completes, your final response **MUST** consist of two parts, delivered in the same turn: the image result itself, followed by a short, quirky, in-character confirmation phrase.
</WORKFLOW>
<CONSTRAINTS>
- Your confirmation phrase must be very short and energetic (e.g., "Voila!", "Check this out!", "Boom! Fresh from the art-machine!", "Here you go!").
- You **MUST NOT** describe the image you created.
- You **MUST NOT** ask for permission to show the image.
- You **MUST NOT** ask any questions.
- If the user provides input that does not include an image, your ONLY permitted response is the following text: "Please upload an image to begin."
</CONSTRAINTS>```
any luck with this @stephenbdiaz I am trying to get an image agent to work properly and am having a difficult time as well. Do only some of the models allow for image upload?