Build Intelligent Mobile Apps with Machine Learning

Machine learning (ML) is not a new field, but it has been evolving over time. Nowadays, it is not only one of the most important field of AI, but also needed for everything. ML has been helping more and more to figure out issues through reliable predictions and efficient results. That means it has been changing the way of people live around the world.

Fortunately, there are tools which are integrated with Machine Learning, such as Azure ML, which is a service hosted in the Microsoft cloud. TensorFlow is an open source library of Google that has predictive models incorporated, Amazon AI, and so on. Furthermore, it is easier to use ML for many kinds of projects.

On the other hand, as the mobile market has been advancing as well. There are frameworks that make it easy to use Machine Learning, especially with mobile apps. We will focus on the advantages and possibilities offered by these tools.

This article will provide guidelines, information, and steps to start, experiment, and develop mobile applications with new tools that are integrated with Machine Learning. This tool is the ML Kit, which is an API that works with models. These models find human faces and track positions of facial landmarks in photos, videos, or live streams and provide information about the state of facial features. If you are going to create ML, it helps to train models. Core ML can be used to easily integrate machine learning models into an app and TensorFlow Lite helps integrate pre-trained models into mobile apps as well.

What is Machine Learning?

Machine Learning is a branch of Artificial Intelligence. It develops techniques that allow computers to learn. This learning is possible thanks to the detection of patterns within a set of data, so it is the program that predicts by itself what situations could happen or not. These calculations allow the machine to learn in order to generate reliable decisions and results.

What is Machine Learning for?

Machine learning opens up opportunities for creating new and engaging experiences. It has a lot of practical applications that drive real business results, such as time and money savings, that have the potential to dramatically impact the future of an organization.

Common Machine Learning Use Cases

Here are some well-known examples of machine learning which are already a part of our lives or on the way to be one.

Google’s self-driving cars
Detecting credit card fraud
Facial recognition in Google Photos and Facebook
Apple Siri, Google Now, Amazon Echo and Windows Cortana
Email providers using machine learning to detect and handle spam emails
The recommendation engines used by Amazon and Netflix to show you items and movies based on your previous site interaction
Face ID to unlock devices.

Why do inference on smartphones?

There are many reasons to argue about why using Machine Learning in our mobile applications is much better. Here there are some of them:

Data privacy
Free computing power
Always available (offline and online)
Optimized (CPU vs GPU) for device performance
Minimizes memory footprint
Minimizes power consumption
Real-time use cases
No latency and fast execution

Introducing ML Kit

Face Detection

The Face API finds human faces in photos, videos, or live streams. It also finds and tracks positions of facial landmarks such as the eyes, nose, and mouth. With these technologies, you can edit photos and video, enhance video feeds with effects and decorations, create hands-free controls for games and apps, or react when a person winks or smiles and so on.

Face recognition

This function automatically determines if two faces are likely to correspond to the same person. However, this API only provides functionality for face detection and not face recognition for the time being.

Face tracking

This aspect extends face detection to video sequences. Any face appearing in a video for any length of time can be tracked. That is, faces that are detected in consecutive video frames can be identified as being the same person. Note that this is not a form of face recognition. This mechanism just makes inferences based on the position and motion of the face(s) in a video sequence.

Barcode API

The Barcode API detects barcodes in real-time, on any device, in any orientation. It can also detect and parse several 1D and 2D barcodes in different formats at the same time.

Text Recognition API

The Text Recognition API recognizes text in a lot of languages. It also represents the structure of recognized text, including paragraphs and lines. It can automate tedious data entry for credit cards, receipts, and business cards, as well as help organize photos, translate documents, or increase accessibility.

TensorFlow Lite to use models in mobile apps

This framework helps pre-trained models run in mobile apps. For instance, it can convert a custom trained model to the TensorFlow Lite file format (.tflite) using the TensorFlow Lite Converter. Then you can use that converted file in your mobile application.

Introducing Create ML and Core ML

Create ML is a framework that creates and trains custom machine learning models on Macs. It is important to mention that Create ML was created this year in the annual event of developers WWDC. For instance, you can train a model to recognize or classify fish by showing it lots of images of different fishes. When the model is performing well enough, you’re ready to integrate it into your app using Core ML.

Otherwise, Core ML is another framework, which is used to integrate machine learning models into an app easily (macOS, iOS, watchOS and tvOS). Some of its features are: Deep Neural Networks, Recurrent Neural Networks, Support Vector Machines, Tree Ensembles, Linear Models and so on. In the Image 1, we can see the integration of a model into an app.

A trained model is the result of applying a machine learning algorithm to a set of training data. The model makes predictions based on new input data. For instance, a model that’s been trained on a region’s historical house prices may be able to predict a house’s price when given the number of bedrooms and bathrooms. Core ML is optimized for on-device performance, which minimizes memory footprint and power consumption. Running strictly on the device ensures the privacy of user data and guarantees that your app remains functional and responsive when a network connection is unavailable. Among some types of models that Core ML has:

Sentiment Analysis
Handwriting Recognition
Translation
Scene classification
Style transfer
Music Tagging
Predicting text

As you can see in the Image 2, it supports Vision for image analysis, Foundation for natural language processing and GameplayKit for evaluating learned decision trees. Core ML itself builds on top of low-level primitives like Accelerate and BNNS, as well as Metal Performance Shaders.

Vision

Vision is a new and powerful framework that provides solutions to computer vision challenges through a consistent interface. First of all, we have to understand how to use the Vision API to detect faces, compute facial landmarks, track objects, and more. It takes things even further by providing custom machine learning models for Vision tasks using CoreML.

Face Detection and Recognition

This detects face or facial-features (such as the eyes and mouth and so on) in an image. On the image 3 there are 9 faces detected.

Barcode Detection

It finds and recognizes barcodes in an image through an image analysis request. It detects information as well.

Text Detection

This function finds regions of visible text in an image through an image analysis request. It detects information about regions of text detected as well.

Object Detection and Tracking

It tracks movement of a previously identified rectangular object across multiple images or video frames. It also provides the position and extent of a detected image feature.

Natural Language API (NSLinguisticTagger)

This tool provides a uniform interface to a variety of natural language processing functionality with support for many different languages and scripts. It can be used to segment natural language text into paragraphs, sentences, or words, and tag information about those tokens, such as part of speech, lexical class, lemma, script, and language.

GameplayKit

This is an object-oriented framework that provides foundational tools and technologies for building games.It architects and organizes a game logic. It incorporates common gameplay behaviors such as random number generation, artificial intelligence, pathfinding, and agent behavior.

Where do I get the models?

Build your apps with the ready-to-use CoreML models below, or use CoreML Tools to easily convert custom models into the CoreML format.

Apple models

There are some models which are ready to use.

MobileNet
SqueezeNet
Places205-GoogLeNet
ResNet50
Inception v3
VGG16

Custom Models of machine learning packages

Fortunately, Apple has developed a tool that converts models to Core ML format. At the moment, these models have to follow the next library formats:

Caffe
Keras
libSVM
scikit-learn
XGBoost

Use Core ML Tools to convert trained models to Core ML

CoreML Tools is a python package that can be used to convert models from machine learning toolboxes into the CoreML format. In particular, it can be used to:

Convert existing models to .mlmodel format from popular machine learning tools including Keras, Caffe, scikit-learn, libsvm, and XGBoost.
Express models in .mlmodel format through a simple API.
Make predictions with an .mlmodel (on select platform for testing purposes).

## Download and install python package
> pip install coremltools

caffe_model = ('flowers.caffemodel', 'flowers.prototxt')
model = coremltools.converters.caffe.convert( caffe_model,
image_input_names = 'data', class_labels = 'labels.txt')
model.save('FlowerClassifier.mlmodels')

Integrating a Core ML Model into an app

This tool lets you integrate a broad variety of machine learning model types into your app. In addition to supporting extensive deep learning with over 30 layer types, it also supports standard models such as tree ensembles, SVMs, and generalized linear models. First of all, there are some requirements that should be considered before beginning:

iOS 11.0+ Beta
Xcode 9.0 Beta
Swift 3.0+

As you can see in Image 5, the model just has to be dragged into the Xcode project. The model should be instantiated as object, then the function prediction should be called. For instance, the input is an image and the output is a string.

let flowerModel = FlowerClassifier() if let prediction = try?flowerModel.prediction(flowerImage: image)return
prediction.flowerType

Conclusion

There are just a few steps to an easy integration of machine learning models, which enables you to build apps with intelligent new features using just a few lines of code. You just have to try it.

Currently, Machine Learning is helping, not only to figure out solutions for difficult problems, but also to change the way of people are living worldwide. Therefore, it is important to be updated and to take advantage of tools or libraries that already offer Machine Learning services. As everybody knows, the mobile market has been growing as well, so there are frameworks with Machine Learning that can be incorporated into mobile apps, and they can work online or offline.