How we use open source deep learning models

Discover Raffle's deep learning framework and models to create our unique AI algorithms that are the core of our products.

Article

Reading time:

By

Raffle

TABLE OF CONTENTS

This is the third post in our ‘The science behind the Raffle-lution’ series.

We recommend you read part one and part two first.

AI powered by deep learning has seen a lot of progress in recent years. This would not have been possible without a strong open-source AI API movement.

The two main open-source deep learning frameworks, PyTorch and TensorFlow, are open-source. Interestingly, Facebook is a strong backer of PyTorch and TensorFlow is a Google project.

Rather than a long discussion of why Big Tech might back open-source and open research, this blog post will look at some of the frameworks and models that are available in natural language processing (NLP) and how they can be used in products.

Frameworks

An open-source deep learning framework is a library that has the necessary elements to train deep learning models, so we need to be able to load and manipulate data, and define, train and deploy models.

At Raffle, we currently favor PyTorch because it is a “Pythonic” (= similar to Python) programming language. TensorFlow has some advantages in terms of putting models into production.

Keras is another library that makes it easier to get started with deep learning. But be warned, becoming a deep learning master is difficult.

Open-source NLP models

Raffle built NLP AI to make it easier for employees to find company information. So we need models that can understand natural language questions and connect the question with knowledge bases to deliver an appropriate answer.

Speech-based interfaces (speech2text and text2speech) and understanding all languages are also highly desirable. So what is available today?

Language modeling. As discussed in the previous post, we can build AI that understands natural text better using language models like BERT as the foundation. Hugging Face is a company that specializes in distributing code and trained models for NLP.
Machine translation will eventually break down language barriers. Recently, Facebook AI open-sourced a machine translation system that translates between 100 languages.
Speech recognition and speech synthesis are already built into your smartphone but are also available as open source so it can be integrated into products.

The Babel fish brought to life?

Having access to machine translation can allow a user to pose a question in a different language than the model is trained on by:

detecting the language
translating
inputting the translated question into our system to get an answer
translating the answer back to the user's language.

Tools like Google Translate are already sophisticated

Speech recognition opens up the possibility for Raffle Site Search to run in voice-based customer service. The size of the models currently stops us from implementing these solutions. For example, the 100-to-100 machine translation model has 15 billion parameters! We simply cannot run such a big model.

Luckily, thanks to open-source deep learning contributors, we know we will soon get a much smaller model that we can run. So one day in the not-so-distant future, Raffle can answer your questions in many languages — it looks like the Hitchhiker’s Guide to the Galaxy got it right after all!

On top of that, there are infrastructure tools that help us improve models:

A/B test - for testing alternative models in production
Bayesian optimization - for automated search over model architectures
Automatic retraining - to continuously train and improve our production models
Active learning schemes - to pinpoint what data we should label next to improve the most

Staying state-of-the-art

Open-source deep learning AI frameworks have helped to enable Raffle's unique AI technology.

We collaborate closely with the Technical University of Denmark and the University of Copenhagen to stay on top of the latest developments in research. This serves as an important inspiration for our work.

It is still in the early days, but we strongly believe that the technology we are developing will improve quickly and find uses far beyond the current products.

In our final post in this series, we will look at recent trends in NLP AI research to see what is just around the corner.

Read about our user cases here.

An AI search engine trained on YOUR content.

More from the Newsroom

Blog

April 10, 2024

What is Site Search and Why You Need It on Your Website