Project

The YouTube generation is creating their own music videos whether rights owners like it or not. Why not offer them tools to channel their creativity and make money while doing it? In the AudioImager-project Helsinki Institute for Information Technology (HIIT) is examining how Open Content could generate value to content business. The project is funded by National Technology agency Tekes and industry partners. The mind-set of the creative industry has changed in past couple of years. The rights owners are willing to try new models for licensing and recognising consumers’ role as co-creators. Part of the testing is done together with Finnish composers’ collecting society Teosto, online music store platform provider SecuryCast and record company EMI music.

It is the night before your big presentation. You are thinking that those bullet points could use some graphics. We have all been there; scrambling in Google image search to find images to our PowerPoint presentations. PowerPoint might be the biggest tool of conducting copyright infringements. HIIT is looking to change that by making millions of open content images easily available right where they are used.

While many of the images found online are protected by copyright and can be used by the rights owner only, there is a growing pool of images that are either in public domain or are licensed with permissive open content licenses. There are nearly 200 million Creative Commons licensed photos on online photo management and sharing application Flickr. Wikimedia commons has another collection of 25 million images. However, finding the right images and using them legally in media products can be difficult. Unlike with commercial stock photo services, open content repositories are rarely integrated to creative software platforms. Finding a suitable image from millions of images coming from several sources can be slow. Image users also have to figure out how to manage required attribution of the original author.

Many of the websites that store open images, like Flickr, offer open Application Programming  Interfaces (API) for third party programs and services to access their collections. However, even computer programs are having tough time accessing different repositories. Millions of open content images are scattered in several repositories behind slow APIs, the search results often return data in multiple forms and the search results have varying quality. However the biggest problem is that searches do not learn and evolve. There is no way to reliably change the irrelevant tags, to write new tags or to change the order of search results.

To overcome difficulties of the decentralised image storage, we decided to build our own database. Our goals are to create a database of the best open content and public domain images, to refine the metadata, to create new linkages and context data to the images and to offer an API for easy and fast access to the images and their metadata.

From images to videos

Google Summer of Code program provides students a stipend to work in an open source project together with a mentor organization. With the help of the Google funding we managed to develop our first tool “AudioImager” which automates the process of video creation utilizing open content images.

AudioImager helps users to create videos by combining audio and Open Content images. Users can enter keywords that describe the audio track to which they want to create a video. Then the system retrieves Creative Commons licensed images from Flickr which are matching the keywords that the user provides. It also provides a Graphical Interface for the user to adjust the durations of each image, preview the video and search for different images if the user is not satisfied with the proposed image. Finally, AudioImager will render the video which can be published online. The software also creates end credits where the photos’ right owners are given credit. Therefore it helps to reduce copyright violations generally committed by amateur video authors.

Video creation is generally perceived as a challenging task. Our goal was to present this task in a new manner where image discovery and retrieval are embedded in to the story telling and video creation process. We also wanted to encourage the usage of legally shared material and automate the cumbersome process of attributing images.

Our next step in the project is to replace Flickr by our own image database. Though Flickr’s photos are great for systems like AudioImager, it lags behind some of our requirements such as the speed of image and metadata access. And also the tags we find in Flicker images leave room for improvement. We can improve the tags of images in our database with the use of different techniques. Another import target is to make our database aware of the context. It could be tuned with every image search and provide more accurate results that suits the audio the most. Hence with our database we could provide more satisfactory results than we could do with Flickr. We also used Wordnet ontologies to feed our database and with that we could provide more meaningful images to the user.

We are hoping to support new kind of creativity which can connect professional “all rights reserved” -material with Open Content. So if you have applications which could benefit from the database, let us know. And stay tuned, this is just the beginning….

Screencast of AudioImager proto

Leave a Reply

Your email address will not be published. Required fields are marked *

*

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>