By Prem Nawaz Khan / @mpnkhan
Statistics
References: World Health Organization Report , WRD Report
Smart phones & Smart speakers - best examples of reviving AI
Apple's Siri, Amazon echo, Google Assistant, Cortana for Windows
Pros & Cons
It is about educating Content editors to input alternate text in their Content Management System
(and/or)Educating Web Developers to put alt text for images
April 2017, Automatic Alt text extension for Chrome
Based on Tensor flow and im2txt Model
April 2018, Around 25 API's made available by Microsoft for public use
Using Facial recognition, we can
Recognizing Humans versus Bots
CAPTCHA sucks?
Examples like iPhone X Facial Unlock, Windows Hello
Automated captions using Lip Reading
Oxford University researchers partnered with Google on a new AI tool that reads lips, and the results were significant.
Trained with a dataset of more than 100,000 natural sentences.
Helpicto is an android app which uses speech to text and Microsoft Cognitive API to convert speech a set of images which students with Language disorders related to autism, dysphasia, or Alzheimer’s disease
Cognitive API uses AI to split the sentences and sends back the intents which is converted to a list of images
The speech command is “Do you want to eat an apple?” Helpicto will then generate three images: the child himself, the action of eating, and a picture of an apple.
Real Time American Sign Language Video Captioning using Deep Neural Networks
More info: Slides and NVIDIA BlogAs a Front End Engineer
Make use of WebRTC in detecting Eye, Hands, Face
Use Web Speech API
function synthVoice(text) {
const synth = window.speechSynthesis;
const utterance = new SpeechSynthesisUtterance();
utterance.text = text;
synth.speak(utterance);
}
SpeechRecognition API
const SpeechRecognition = window.SpeechRecognition || window.webkitSpeechRecognition;
const recognition = new SpeechRecognition();
As a Back End Engineer
Open source Machine Learning Frameworks
Google's Tensor flow
Microsoft Cognitive Toolkit (CNTK)
Theano
MATLAB,
Caffe and Caffe2,
Keras,
Pytorch,
MXNet,
Deeplearning4j,
NVIDIA Caffe,
Chainer,
PaddlePaddle