In today’s blog post, I interview PyImageSearch Gurus member, Tuomo Hiippala, who was recently awarded a 28000€ grant (approximately $30,500 USD) to research how computer vision can be used to study visual culture, including social media images and photo archives. We’ll discuss his very unique area of research, which blends linguistics with computer vision, and how his work is shaping modern day visual data analysis.
Tuomo has been a longtime reader of the PyImageSearch blog and was even one of the very first backers of the PyImageSearch Gurus Kickstarter campaign. It’s an honor to have him here today and it’s only fitting that he’s the first person I’ve interviewed on the blog. There is a lot of knowledge to be gained from this interview with Tuomo — especially how computer vision can be applied to real-world problems.
Ideally, I hope these types of spotlight and interview posts can be more frequent on the PyImageSearch blog in the future. While I try to post new tips, tricks, and tutorials every week, what’s really important is how you, the reader, take this knowledge and apply it to your own projects.
If the PyImageSearch blog, Practical Python and OpenCV, or the PyImageSearch Gurus course have helped you along your computer vision journey, please leave a comment on this blog post or send me message. Like I said, I would love to do more of these interview posts in the future!
PyImageSearch Gurus member spotlight: Tuomo Hiippala
Adrian: Hey Tuomo! Thanks for agreeing to do this interview. It’s great to have you on the PyImageSearch blog.
Tuomo: As a long-time reader: my pleasure!
Adrian: Let’s go ahead and kick off this interview. Where do you work/what is your job?
Tuomo: I currently work as a post-doctoral researcher at the Centre for Applied Language Studies, University of Jyväskylä, all the way up in Finland. As for my background, I got a PhD in English philology from the University of Helsinki, specializing in linguistics. I’m a special kind of linguist though, because my interests extend way beyond language.
Basically, I do research on how language interacts with other modes of communication, such as photographs, diagrams, infographics … you name it. Visiting a website or picking up the daily newspaper quickly makes one realize why it makes sense to examine all these different modes together. They are closely coordinated and interact with each other all the time – it’s an inherent feature of communication, but curiously enough, something that has only recently really caught on with researchers from different fields.
Adrian: How did you first become interested in computer vision?
Tuomo: I had to check my notebook for this: I wrote down something during the 2014 Frankfurt Book Fair in Germany … I remember vaguely checking out some of the exhibitioners at the trade fair, who advertised automatic document management solutions. That’s when the idea really hit me, although I was obviously already familiar with techniques such as Optical Character Recognition, which pretty much revolutionized linguistics back in the 1990s, removing the logjam of manual input. Anyhow, that’s when I decided to learn more about computer vision and get to know Python on the side.
Adrian: How do you use computer vision in your job?
Tuomo: I’ve got a few projects that I’m working on at the moment: the first one involves developing a semi-automatic document analysis system, which uses OpenCV, Tesseract and NLTK to examine document images and annotates their features into an XML database. Commercial OCR software is not really useful for this purpose, because it is mainly concerned with extracting the content. For studying how the different modes of communication work together, we need to know much more: the content, its placement in layout, logical organisation and semantic interrelations – and this just gives us a minimal view of a document and its inner workings. I hope to present my ongoing work at the Association for Computational Linguistics annual meeting in Berlin next summer!
I’m also looking into how computer vision could be used to enable the researchers to get a stronger grip on large volumes of visual data. Many different fields in the humanities and social sciences are concerned with visual communication – yet many researchers tend to do visual analysis by hand, which really does not allow them to seriously engage with the volumes of data typically found in historical photo archives or social media services such as Instagram. There is a huge potential in open source libraries such as OpenCV and other tools that make programming more accessible – IPython comes to mind as a good candidate for developing and sharing such tools. This is what I’d like to contribute to, focusing on the applications and leaving the development of algorithms to computer scientists. I nevertheless try to learn about the algorithms as well!
Adrian: You mentioned that you applied for a grant. Can you tell me more about that?
Tuomo: Sure! Finnish Cultural Foundation just awarded me a personal grant worth 28 000€ (approximately $35,000 USD) to do research on how computer vision can be used to study visual culture, such as social media images and photo archives. I also got a 3500€ grant from the City of Helsinki in 2015 to study the photographs shared by the tourists visiting Helsinki, that is, how they see the city, which is another long-term research interest of mine. I’m currently waiting for the beginning of the tourist season to start collecting data!
Note: Interested in learning more about Tuomo’s research? Be sure to give these two pages a read:
- https://www.csc.fi/web/atcsc/-/humanisti-datailee
- http://www.inthefieldstories.net/a-humanist-take-on-scientific-computing/
Adrian: How did the PyImageSearch/PyImageSearch Gurus help you in obtaining the grant?
Tuomo: What can I say? The blog and the course have benefitted my work immensely: it has opened up a whole new area of research for me. Although I’ve been learning computer vision for one and half years only, I’m already confident enough to apply the techniques in research. And this is obviously essential for acquiring any funding at all.
Adrian: Would you recommend the PyImageSearch Gurus course to other developers and students trying to learn computer vision?
Tuomo: Absolutely – I think the course is of extremely high quality and worth every cent – I think I was among the first five people to back it on Kickstarter! What I find particularly appealing is that the course materials are extremely well-written and presented in a way that allows one to learn Python at the same time. They go nicely hand-in-hand, as do the course and the surrounding community!
Adrian: Anything else you would like to share?
Tuomo: I would certainly recommend learning about computer vision (and machine learning) to anyone, regardless of their professional or academic interests. I think the variety of projects that have been showcased inside the PyImageSearch Gurus course nicely illustrates the wealth of applications that computer vision can and will have in the future. And the best part is, most of these techniques are really accessible to a layperson, thanks to Python and the wealth of quality tutorials. Just dive in, it will definitely pay off!
Adrian: If a PyImageSearch reader wants to chat, where is the best place to connect with you?
Tuomo: You can find me on Twitter (http://twitter.com/tuomo_h/), LinkedIn (https://fi.linkedin.com/in/tuomohiippala) and GitHub (https://github.com/thiippal). Feel free to get in touch!
What's next? We recommend PyImageSearch University.
86 total classes • 115+ hours of on-demand code walkthrough videos • Last updated: October 2024
★★★★★ 4.84 (128 Ratings) • 16,000+ Students Enrolled
I strongly believe that if you had the right teacher you could master computer vision and deep learning.
Do you think learning computer vision and deep learning has to be time-consuming, overwhelming, and complicated? Or has to involve complex mathematics and equations? Or requires a degree in computer science?
That’s not the case.
All you need to master computer vision and deep learning is for someone to explain things to you in simple, intuitive terms. And that’s exactly what I do. My mission is to change education and how complex Artificial Intelligence topics are taught.
If you're serious about learning computer vision, your next stop should be PyImageSearch University, the most comprehensive computer vision, deep learning, and OpenCV course online today. Here you’ll learn how to successfully and confidently apply computer vision to your work, research, and projects. Join me in computer vision mastery.
Inside PyImageSearch University you'll find:
- ✓ 86 courses on essential computer vision, deep learning, and OpenCV topics
- ✓ 86 Certificates of Completion
- ✓ 115+ hours of on-demand video
- ✓ Brand new courses released regularly, ensuring you can keep up with state-of-the-art techniques
- ✓ Pre-configured Jupyter Notebooks in Google Colab
- ✓ Run all code examples in your web browser — works on Windows, macOS, and Linux (no dev environment configuration required!)
- ✓ Access to centralized code repos for all 540+ tutorials on PyImageSearch
- ✓ Easy one-click downloads for code, datasets, pre-trained models, etc.
- ✓ Access on mobile, laptop, desktop, etc.
Summary
In today’s post, I interviewed Tuomo Hiippala, who works for the University of Jyväskylä. His research blends two unique areas of study: linguistics and computer vision, where his work focuses on enabling researchers to better understand large volumes of visual data.
Ideally, I would love to do more PyImageSearch reader interviews and spotlights in future blog posts. If the PyImageSearch blog, Practical Python and OpenCV, or the PyImageSearch Gurus course has helped you along your computer vision journey, please comment on this post or send me a message and let me know!
Join the PyImageSearch Newsletter and Grab My FREE 17-page Resource Guide PDF
Enter your email address below to join the PyImageSearch Newsletter and download my FREE 17-page Resource Guide PDF on Computer Vision, OpenCV, and Deep Learning.
arjun
Congrats Tuomo on your grant… very cool application you’re working on.
I have an unrelated question that I’d appreciate your help on. Do either of you have guidance for how to port a python application that uses computer vision libraries like openCV to run on iOS?
Do you use a bridge like PyObjC or Kiva, or just re-write in ObjectiveC or Swift? Thank you.
Adrian Rosebrock
It depends on your actual application. Whenever appropriate, I suggest wrapping your computer vision app as an API and then accessing it via your programming language of choice (and can afford to send the image over the network). However, this is only suitable for non-realtime applications. In the case where you need real-time performance, you should implement in Objective-C or Swift.
If you’re interested in the former approach (using the computer vision API), take a look at PyImageSearch Gurus which has an entire module dedicated to using PhoneGap/Cordova to (i.e., JavaScript + HTML) to build a mobile computer vision application by calling a web API.
Kent
This is amazing ! When will the Gurus open again ? XD
Adrian Rosebrock
Right now the PyImageSearch Gurus course is set to open every 2-4 weeks. This might change in the future though.