Online

Elman Mansimov

alignDRAW

Dec 11 - 13, 2023

Featured Exhibition Image

Overview

This is the Nicéphore Niépce of AI... This is amazing.

- Darius Himes, International Head of Photographs at Christie's

By early 2015, neural networks had mastered the art of 'image-to-text' and could create natural language captions for images. Flipping this process, and turning text into image, was a much more complex challenge. 19-year old prodigy Elman Mansimov's alignDRAW model solved this problem, a breakthrough that marked a new era of human-machine collaboration.
In his seminal paper, ‘Generating Images from Captions with Attention‘, Mansimov led a team of researchers at Toronto University to create a model that didn’t simply pull existing images from databases, like a search engine would, but rather to generate completely unique scenarios that had no parallel in reality. They tested it by requesting it to produce visuals of unheard scenarios, such as “a toilet seat sits open in the grass field” or “a green school bus parked in a parking lot”.
The small 32 by 32 pixel outputs are significant as a proof of concept for a new form of communication between machines and humans, one that involves our own language instead of code. The alignDRAW model marked the day when we could finally exchange ideas with machines as equals, with these images portraying the infancy of a technology that would quickly mature and come to define artistic practice in the early 2020s.

“When we look at the initial low-resolution and hazy images from nearly two centuries ago, we see the whole future potential of photography, which eventually became the dominating imaging and communication technology of our time. And when I look at comparable low-resolution alignDRAW pictures, I see a similar promise for a new major visual method that could very soon become as essential as lens photography was in the last two hundred years.”
– Dr. Lev Manovich

"Modern machine learning approaches to text to image synthesis started with the work of Mansimov et al. (2015)"
– Ilya Sutskever, Mark Chen, Alec Radford et al. in their DALL-E paper ‘Zero-Shot Text-to-Image Generation’
Read More

Artworks:

Featured

All