Product Hunt Weekly Digest
February 11th, 2024

PRODUCT HIGHLIGHT
Apple released a new model for image editing

Apple teamed up with researchers at the University of California, Santa Barbara (Go gauchos!) to create a new AI model called MGIE (MLLM-Guided Image Editing).

The model can edit images — think of the stuff you do in Photoshop, but with plain English prompts. It can crop, resize, rotate, brighten, sharpen, remove objects, and so on, plus edit specific parts of your image, like changing the shape, size, and texture of items.

Perhaps the most interesting or “revolutionary” thing about the model is that it derives expressive instruction from your prompt.

Imagine using a phrase to ask AI to edit your photo for you. It might look something like this: “Make the sky more blue.” But MGIE can produce instructions like this: “Increase the saturation of the sky region by 20%,” explains Michael Nuñez for VentureBeat.

The model also works by creating a representation of the edit that the end user wouldn’t see, which it uses as a guide for how to manipulate pixels — “an end-to-end training scheme that jointly optimizes the instruction derivation, visual imagination, and image editing modules.”

That’s a lot for me to process too, but the key point is that the model is being recognized as a breakthrough for using multimodal large language models (MLLMs) for interpreting user input and making edits at the pixel-level. People are also loving that it's open-source. The project is on GitHub and a demo is on Hugging Face.

Edit photos with words
Sponsored By
Newsletter Sp-onsor

Tired of explaining the same thing over and over again to your colleagues? It’s time to delegate that work to AI. guidde is a GPT-powered tool that helps you explain the most complex tasks in seconds with AI generated documentation.

  • Turn boring documentation into stunning visual guides
  • Save valuable time by creating video documentation 11x faster
  • Share or embed your guide anywhere for your team to see

    Simply click capture on our browser extension and the app will automatically generate step-by-step video guides complete with visuals, voiceover and call to actions.

    The best part? Our extension is 100% free.

  • TALES FROM PLANET INTERNET

    🤗 AI for Good: Sensay is a platform that uses AI to help dementia patients and their families preserve cherished memories, likenesses, stories, and personalities while also assisting patients by digitally replicating people closest to them.

    🔨 Midjourney phone: Okay, maybe not a phone, but it does look like Midjourney could be cooking up something in the hardware department. The company recently hired an engineer from the Apple Vision Pro to be its “Head of Hardware.”