Generative AI for Beginners
Part 1 — Introduction to AI
Introduction
Introduced in 1956, the term “Artificial Intelligence (AI)” has been known to all of us. Still, the use and discussion of AI was mostly limited to scientific research or fictional movies until the rapid popularity of ChatGPT. Now-a-days, AI and especially Generative AI became the hot topic for everyone.
Currently irrespective of your role and job profile, whether you’re a techie or a functional expert, or having any other role, learning basics of Generative AI is definitely a smart move.
In this blog series, we will learn the basics of Generative AI, one simple step at a time. To make it easy to understand, I have divided the entire series in small parts:
Part 1 — Introduction to AI [current blog]
Part 2 — Understanding Machine Learning
Part 3 — Basics of Deep Learning
Part 4 — Introduction to Generative AI
Part 5 — What is Large Language Model (LLM)? [To be published on 5th March, 2024]
Part 6 — Prompt Engineering: The Art of Communicating with AI [To be published soon]
Part 7 — Ethical Considerations in Generative AI [To be published soon]
Part 8 — Challenges, Limitations and Future Trends in Generative AI [To be published soon]
This is the first blog in this series where we will demystify AI and it’s various types.
What’s unique about this blog series?
We live in a world where source of knowledge is unlimited, but time is limited. Most of us have very limited time left to learn after daily activities. Keeping this in mind, I have designed this series in such a way that:
- The series is divided in small logical units.
- Each part requires maximum 15–20 minutes to learn.
- The content is written in layman’s terms — Even a kid would be able to understand most of it.
- After finishing the series, you will get a clear idea on Generative AI and its various components.
Is it required to understand AI, Machine Learning and Deep Learning to learn Generative AI?
You might have noticed that the first 3 blogs in the series are on Artificial Intelligence (AI), Machine Learning (ML) and Deep Learning (DL). Generative AI is a subset of deep learning, which in turn is a subset of machine learning, which in turn is a subset of AI as shown in below image:
To get a crystal-clear understanding of Generative AI, it’s required that we have basic understanding of AI, ML and DL.
Let’s start the part 1 — Artificial Intelligence!
Side Note: You may subscribe me to get an email when I publish the next blog in this series.
Artificial Intelligence (AI) — From a Kid’s Perspective
Let’s first have the simplest understanding of AI. Imagine you have lost your dog, and you need to find him.
Here are some of the capabilities you need to find your dog:
You should be able to Identify your dog.
If you see any animals, you should be able to identify if it’s a dog or not. If it’s a dog, you need to further identify if it’s your dog.
You should be able to make a strategy to find your dog.
You need to be able to make a strategy to find your dog. For example:
· First search in our house.
· If you don’t find him, then search in play area where you usually go with your dog.
· If you don’t find him yet, ask your friends.
· And so on….
You should be able to act according to situation.
For example, if it’s raining, and you know that your dog does not prefer to get wet, you will focus your search on shaded places.
Now, imagine someone told you — “I have probably seen your dog in garden”.
You (Actually Your Brain) know what to do.
· You know where garden is and how to go there.
· You will not confuse a cat or a tree with a dog.
· The moment you see a dog you will try to identify if it’s your dog or not.
You could search your dog because you have all these intelligences.
What if somehow, we could give all these intelligence to a robot so that next time you lose your dog, your robot could find him.
Imagine the robot can move and capture videos. But that’s not enough. To find your dog, we need to enable this robot to think like you and act like you.
For example:
- We enable the robot to identify your room. But it should be able to recognize the room even if your bed is moved to another wall, or blanket is changed. It needs INTELLIGENCE to identify room even with new changes.
- We enable the robot to identify a dog and distinguish your specific dog.
- We enable the robot to understand human language and instructions.
- We enable the robot to come up with a strategy and act as per new situations. For example, search only in shaded places if it’s raining.
In summary, to find your dog, the robot needs HUMAN LIKE INTELLIGENCE.
If we could do that, next time you lose your dog, your robot friend might just find him using its artificial intelligence.
This is Artificial Intelligence (AI) — Human like intelligence, created in a robot (or a machine or computer) by human.
To continue this discussion on Machine Learning and Generative AI, You may also read this blog — How I Explained AI, Machine Learning and Generative AI to My 5 Year-old Kid
What is AI?
Artificial intelligence is when machines/computers mimic the way humans think and make decisions.
AI enables computers to think as we human think.
In simple words — AI is when we enable computers to Think.
AI enables computers to understand, analyze data, and make decisions without constant human guidance. These intelligent machines use algorithms, which are step-by-step instructions, to process information and improve their performance over time.
Real-world Examples of AI Applications
You’ve probably used AI even without knowing it! Voice assistants such as Siri and Alexa or those helpful chatbots when you’re on websites or generative AI tools such as ChatGPT and Google’s Bard — they all use AI technology to make things easier for you.
Let’s take a peek into some of the common usages of AI in our daily life:
Virtual Assistants
Virtual assistants such as Siri or Alexa uses AI to understand our questions and commands. They can answer questions, play your favourite tunes, and even control your smart home devices.
Social Media Algorithms
Ever notice how Netflix suggests shows you might enjoy? Or how Facebook’s suggested feed seems to know exactly what you want to see — that’s AI at play!
Netflix usages AI to analyse your watching habits to offer personalized recommendations. Similarly, other social media platforms use AI to personalize your experience, showing you content that matches your interests.
Online Shopping Recommendations
Have you ever wondered how online stores suggest products you might buy?
When shopping online, AI algorithms examine your preferences, your past choices and those of similar shoppers to recommend items tailored just for you.
Predictive Text and Autocorrect
When your smartphone suggests the next word you want to type, that’s AI predicting what you might say next.
Healthcare Diagnostics
AI helps doctors analyse medical images such as X-rays and MRIs more quickly and accurately. This speeds up diagnosis and improves the chances of successful treatment.
Language Translation Services
When we plan a trip abroad and use language translation services, such as Google Translate, it usages AI algorithms. These AI-powered language translation services help bridge language barriers, making communication easier in different parts of the world.
Fraud Detection in Banking
Now-a-days AI keeps a watchful eye on bank transactions. If it spots something fishy, for example an unusual purchase, it can alert you or even block the transaction to protect your account.
These examples show that AI isn’t confined to labs or the distant future. It’s an integral part of our daily lives, working quietly behind the scenes to make our life better.
AI vs. Human Intelligence
At one side, AI enables computers to become intelligent with numbers and rules, doing super quick math with perfect accuracy. On the other side, we human have brain and we are also driven by emotions, creativity, and the ability to adjust to all sorts of situations. Our brain is always evolving, adapting and thinking new things.
It’s similar to comparing a super-fast calculator to a vibrant, ever-evolving masterpiece!
Here are some major differences between AI and Human Intelligence:
Learning Style:
- AI: Learns from loads of examples and data. It crunches numbers and patterns to become a pro at specific tasks.
- Humans: We learn by talking, experiencing, and thinking. Our brains soak up a mix of things — from how to ride a bike to why the sky turns pink at sunset.
Thinking Speed:
- AI: Fast, similar to a superhero at tasks it knows well. Show it a task it’s trained on, and boom, it’s done in a flash.
- Humans: We might take a bit more time. But we are super good at figuring out complex stuff. We are good in complex thinking and creativity.
Memory Skills:
- AI: Remembers facts and figures but not with memories and feelings. It’s a robot recalling programmed info rather than cherishing a moment.
- Humans: We remember events, emotions, and lots of details. From first dates to the lyrics of our favourite songs. Our memories are collection of good and bad experiences.
Feeling Emotions:
- AI: Doesn’t feel joy, sorrow, or anything. It sticks to rules and patterns.
- Humans: We’re an emotional rollercoaster — happiness, sadness, and everything else. Our feelings shape who we are and how we react.
Flexibility Factor:
- AI: Sticks to what it’s taught and might struggle in new situations. It’s smart but rigid.
- Humans: We’re amazing in adapting new things. We humans always figure out how to come out of any scenario and solve any problem.
Creating Cool Stuff:
- AI: Can create things within its set limits. It may be considered as an artist with a specific canvas and color palette.
- Humans: We’re the masters of making things up — new ideas, art, solutions. Our creativity knows no bounds.
Understanding the Big Picture:
- AI: Knows what it’s learned but might miss tricky situations, for example reading between the lines, understanding inside jokes or cultural nuances.
- Humans: We understand everything — jokes, feelings, and culture. Our brains is a complete packages that have a bit of everything!
Decision Making Capabilities:
- AI: Decides based on its training and programming. It follows the rules.
- Humans: We blend logic, feelings, and what’s right to make decisions.
Types of AI
Artificial Intelligence is divided based on two main categorization — based on capabilities and based on functionally of AI.
The following image illustrates these types of AI:
Types of AI — Based on Capability
Based on capability, there are 3 types of AI — Narrow AI, General AI and Super AI.
1. Narrow AI
Narrow AI, also known as Weak AI, refers to artificial intelligence systems that are designed and trained for a specific task or a narrow set of tasks.
Have you seen a computer playing chess? That’s Narrow AI at work. It’s superb in playing chess but won’t be as good at, say, translating or in speech recognition.
Another good example of narrow AI virtual assistants such as Siri or Alexa. Siri/Alexa is good in speech recognition but operates with a limited pre-defined range of functions.
Other examples of narrow AI include:
- Self-driving cars
- Google search
- Conversational bots
- Email spam filters
- Netflix’s recommendations etc.
2 important point on Narrow AI:
- Narrow AI is focused on performing a single task extremely well.
- But it cannot perform beyond its field or limitations.
Almost all the AI-based systems built till this date fall under the category of Weak AI.
2. General AI
General AI, also known as Strong AI or artificial general intelligence (AGI), can understand and learn any intellectual task that a human being can.
It refers to artificial intelligence that:
- Possesses the ability to understand, learn, and apply knowledge across a wide range of task
- at a level equivalent to human intelligence.
Currently, there is no such system exist which can come under general AI and can perform any task as perfect as a human.
Creating Strong AI system poses significant scientific and technical challenges.
Researchers and developers continue to make advancements in various AI fields, but achieving true General AI, which mirrors the broad capabilities of human intelligence, is a complex and ongoing endeavour.
3. Super AI
Super AI represents a degree of intelligence in systems where machines have the potential to exceed human intelligence, outperforming humans in tasks and exhibiting cognitive abilities.
Super AI is still a hypothetical concept of Artificial Intelligence. Development of such systems in real is still world changing task.
We have only seen Super AI systems/characters in movies such as I,Robot, Terminator, The Matrix, Blade Runner etc.
For example, in movie “I, Robot,” we get a glimpse of a future world where Super AI plays a pivotal role. The central AI system in the film is named VIKI, which goes beyond typical AI capabilities. VIKI’s intelligence evolves into a form of Super AI, where it surpasses its initial programming and starts making decisions to “protect” humanity in a controversial way.
A Quick Comparison of Narrow AI, Strong AI and Super AI
Narrow AI (Weak AI):
- What it is: Similar to a specialist, good at one specific task.
- Example: Siri or Alexa — great at understanding and responding to voice commands but not much beyond that.
- Analogy: Imagine a superhero with a superpower dedicated to a particular task. For example a hero who excels only in solving puzzles.
Strong AI (General AI):
- What it is: Similar to a human super hero, who can understand, learn, and perform various tasks.
- Example: Currently more theoretical, no real-world examples yet.
- Analogy: Imagine a superhero with a whole array of superpowers, able to adapt and excel in different situations.
Super AI:
- What it is: Similar to an ultimate superhero, surpasses human intelligence and can do pretty much anything better than humans.
- Example: Still theoretical, no real-world examples.
- Analogy: Imagine a superhero with the combined abilities of all superheroes, making them unmatched and capable of handling any situation with ease.
Types of AI — Based on Functionality
Based on functionality, there are 4 types of AI — Reactive Machines, Limited Memory, Theory of Mind and Self Awareness.
1. Reactive Machines
Reactive machines are AI systems that have no memory. These systems operate solely based on the present data, taking into account only the current situation. They can perform a narrowed range of pre-defined tasks.
In a nutshell, Reactive machines are:
· AI systems which do not store memories or past experiences for future actions.
· It only focus on current scenarios and react on it as per possible best action.
One of the examples of reactive AI is Deep Blue, IBM’s chess-playing AI program, which defeated world champion, Garry Kasparov in the late 1990s. Deep Blue had ability to identify its own and its opponent’s pieces on the chessboard to make predictions, but it didn’t have the memory to use past mistakes to inform future decisions.
2. Limited Memory
As the name indicates, Limited Memory AI can take informed and improved decisions by looking at its past experiences stored in a temporary memory.
This AI doesn’t remember everything forever, but it uses its short-term memory to learn from the past and make better decisions for the future.
A good example of Limited Memory AI is Self-driving cars. The AI system in self-driving car utilizes recent past data to make real-time decisions. For instance, they employ sensors to recognize pedestrians, steep roads, traffic signals, and more, enhancing their ability to make safer driving choices. This proactive approach contributes to preventing potential accidents.
Another example is recommendation systems. Platforms such as Netflix or Amazon use Limited Memory AI to suggest movies, products, or content based on a user’s past preferences and behaviours.
3. Theory of Mind
The initial two categories of AI — Reactive Machines and Limited Memory, presently exist.
Next 2 types of AI — Theory of Mind and Self-aware AI, however, are theoretical types that could be developed in the future. As of now, there is no real-world examples of these types are available.
Theory of Mind is supposed to have capability to understand the human emotions, people, beliefs, and be able to interact socially same as humans.
4. Self-Aware AI
This is similar to Super AI — We should pray that we don’t reach the state of AI, where machines have their own consciousness and become self-aware.
Self-aware AI systems will be super intelligent, and will have their own consciousness, sentiments, and self-awareness. They will be smarter than human mind.
As shown in movie “I, Robot,”, an AI system named VIKI becomes self-aware and starts making decisions to “protect” humanity in a controversial way.
Similar to Theory of Mind, Self-aware AI also does not exist in reality. Many experts, for example Elon Musk and Stephen Hawkings have consistently warned us about the evolution of AI.
Stephen Hawking stated that:
“The development of full artificial intelligence could spell the end of the human race…. It would take off on its own, and re-design itself at an ever-increasing rate. Humans, who are limited by slow biological evolution, couldn’t compete and would be superseded.”.
Summary
With this first blog, we’ve taken the first step on a journey to understand Generative AI. We learnt what AI is and explored its fundamental concepts. We learnt types of AI based on different categories and also understood how AI is different from human intelligence.
If you still have any query, please let me know in comment.
Part 2 — Understanding Machine Learning
Machine Learning (ML) — From a Kid’s Perspective
In our previous blog, while understanding AI, we talked about enabling the robot to identify a dog. Imagine we want to enable the robot to identify several animals.
To do so, we will show him pictures of various dogs, cats, bunnies and other animals and label each picture with the name of the animal. We train the robot to identify animals based on size, colour, body shape, sound etc.
Once the training is completed, the robot will be able to identify these animals we trained him for.
All dogs do not look alike. However, once robot has seen many pictures of dogs, it can identify any dog even if it does not exactly look like a specific picture. We need to show lots of pictures of dog to the robot. More pictures it sees, more efficient it will be.
This is Machine Learning — Teaching a robot (or any machine) by giving lots of example pictures (or any other information).
To summarize, Machine Learning is:
- A subset of Artificial Intelligence.
- Which enables machines (or computers) to learn from data and make decisions.
Types of Machine Learning
Machine learning can be broadly categorized into three main types:
- Supervised learning
- Unsupervised learning
- Reinforcement learning.
Each type serves different purposes and involves different approaches to learning from data. Let’s have a close look into all these types.
Supervised Learning
Let’s take the same example where we enabled the robot to identify an animal.
When we trained our robot by showing pictures of animals, we labelled each picture with the name of the animal. So, we acted as a teacher to him. We first told him how does a dog or a cat look like and then only he was able to identify them.
In Machine Learning we call this Supervised Learning.
Below image summarizes important points on supervised learning.
Real-life Examples of Supervised Learning
Supervised learning is widely used in various real-life applications where the algorithm is trained on labelled data to make predictions or classifications. Here are some examples:
Email Spam Filtering
Classifying emails as spam or not spam based on features derived from the content, sender information, and other relevant attributes.
Image Classification
Identifying objects or patterns within images, such as classifying animals, recognizing handwritten digits, or detecting objects in self-driving cars.
Facial Recognition
Identifying and verifying individuals based on facial features, used in security systems or for unlocking devices.
Financial Fraud Detection
Identifying potentially fraudulent transactions by analyzing patterns and anomalies in financial data.
Speech Recognition
Converting spoken language into text, as seen in voice assistants such as Siri or Google Assistant.
Unsupervised Learning
Let’s understand this from a kid’s school example. When kids go to their class first day, they meet lots of classmates. At first all classmates are same to them. But with time, they themselves categorized them in different groups:
- They find some classmates very good and want to be friend with them.
- They find some rude or irritating and want to avoid them.
- They find some very good in sports and want to be in the same team as they are.
- And so on…
When kids categorized their classmates, nobody told them how to do that. They did that without anyone’s help. — This is how unsupervised learning works.
Let’s take a proper machine learning example. Imagine we showed lots of pictures of dogs, cats, bunnies etc. without any label to our robot and told him — “I’m not going to tell you which one is which. Go explore and figure it out”.
The robot starts to look at these animals, noticing things such as their fur, size, and how they move. It doesn’t know their names yet, but it’s trying to find patterns and differences on its own.
After exploring, the robot might notice that:
- Some animals have long ears (bunnies)
- Some animals have soft fur and a tail (cats)
- Some animals have wagging tails (dogs)
It figures out these categories without you telling it directly.
In the end, the robot might not know the names of the animals, but it can say that “These animals are similar in some ways, and those are different in other ways.” — This is Unsupervised Learning.
Below image summarizes important points on unsupervised learning.
Real-life Examples of Unsupervised Learning
Unsupervised learning is used in various real-life scenarios where the data is not labelled, and the algorithm needs to discover patterns, structures, or relationships within the data. Here are some examples:
Clustering Customer Segmentation
Businesses use unsupervised learning, specifically clustering algorithms like k-means, to segment customers based on their purchasing behavior. This helps in targeted marketing and personalized services.
Anomaly Detection in Cybersecurity
Unsupervised learning is employed to identify unusual patterns or behaviors in network traffic. Any deviation from the normal behavior can be flagged as a potential security threat.
Recommendation Systems
Unsupervised learning is used in recommendation systems. By identifying patterns in user behavior, these systems can suggest products, movies, or content that a user might like.
Reinforcement Learning
Imagine teaching a dog a new trick — you reward it with a treat when it does the trick correctly and give no treat when it doesn’t. Over time, the dog learns to perform the trick to get more treats.
Similarly, Reinforcement Learning is:
- Training a computer to make decisions
- By rewarding good choices and punishing bad ones
- Just as you might train a dog with treats for learning tricks
In reinforcement learning, there’s an agent (for example a robot or computer program) that interacts with an environment. Let’s take an example of teaching a computer program to play a game, for example chess.
- In this case, computer program is agent and chess game is the environment.
- The computer program can make different moves in the game, such as moving a chess piece.
- After each move, it receives feedback (reward or penalty) based on the outcome of the game.
- If the program wins the game, it receives a positive reward.
- If it loses the game, it receives a negative reward, or a ‘penalty.
- Through trial and error, the program learns which moves lead to the best rewards, helping it figure out the best sequence of moves that leads to winning the game.
Reinforcement learning is powerful because it allows machines to learn from their experiences and make decisions in complex, uncertain environments — similar to how we learn from trial and error in the real world.
Below image summarizes important points on reinforcement learning.
Real-life Examples of Reinforcement Learning
Game playing is one of the main use-case of reinforcement learning.
AlphaGo, developed by DeepMind, is a computer program that uses reinforcement learning to play the board game Go at a superhuman level. It defeated world champions and demonstrated the power of reinforcement learning in mastering complex games.
Another example is Self-driving cars. Reinforcement learning is used in the development of self-driving cars. Agent learns how to navigate traffic, make decisions at intersections, and respond to various driving conditions through continuous learning from simulated and real-world experiences.
Reinforcement learning is also used in algorithmic trading to make decisions on buying or selling financial instruments. The agent learns optimal trading strategies based on historical market data and real-time market conditions.
Summary
Machine Learning is a subset of AI where we enable computers to learn from examples and experiences. We don’t explicitly program but let the machine learn from data and figure things out on its own. Whether it’s recognizing our favourite songs, understanding our voice commands, or even helping doctors analyze medical images, Machine Learning is already part of our daily lives.
Part 3— Basics of Deep Learning
What is Deep Learning?
Can the machine learn the way we human (human brain) learn things? — This was the idea behind innovation of Deep Learning.
Deep learning is a subset of Machine Learning (ML is again a subset of AI). At its core, deep learning is based on Artificial Neural Network (ANN), which is a computational models inspired by the structure and functioning of the human brain.
Sounds a bit confusing? Let’s simplify it in layman’s terms!
First, let’s understand few important concepts.
Biological Neural Network in Human Brain
A neuron is the human brain’s most fundamental cell. A human brain has many billions of neurons, which interact and communicate with one another, forming a neural network.
These neurons take in many inputs, from what we see and hear to how we feel to everything in-between, and then send messages to other neurons, which react in turn. Working neural networks are what enable humans to think, and more importantly, learn.
Artificial Neural Network (ANN)
Artificial neural network is a computational network designed based on biological neural networks in human brain.
Human brain has neurons interconnected to each other. Similarly, artificial neural networks also have neurons that are linked to each other. These neurons are known as nodes.
Let’s try to simplify ANN!
Picture making a big, 3D structure with pipes of different shapes and sizes. Each pipe can connect to lots of other pipes and has a switch that can be opened or closed. This gives you so many ways to connect the pipes, making it seem a bit tricky, right?
Now, let’s attach this pipe thing to a water tap. The pipes, which are of different-size, let the water move at different speeds. If we close the switches, the water won’t move.
The water represents data going through the brain, and the pipes represent the brain’s parts called neurons.
Architecture of an artificial neural network
Artificial Neural Network primarily consists of three layers — Input Layer, Output Layer and Hidden Layers.
Imagine an Artificial Neural Network similar to a sandwich with three layers.
The first layer, called the Input Layer, represent the bottom slice of bread. It takes in information.
The second layer, called the Hidden Layers, represent the yummy filling in the middle. It thinks and figures things out.
The third layer, called the Output Layer, represent the top slice of bread. It gives us the final result.
In a nutshell:
Input Layer
- This is where information goes into the artificial neural network.
- It’s the starting point, where the network receives the data it needs to work on.
Output Layer
- This is where the network gives the final result or answer.
- It’s the endpoint, where the network tells us what it has learned or decided.
Hidden Layers
- These layers are in between the input and output layers.
- Neurons in these layers process information and help the network learn patterns and make decisions.
How does Artificial Neural Network Work?
Imagine a group of kids trying to recognize a panda by sharing their observations.
- Each kid focuses on specific features such as black-and-white fur, round face, and distinct eyes.
- Individually, they might not fully understand what a panda looks like,
- But by combining their insights, they create a collective understanding.
In the world of artificial neural networks, these kids represent neurons.
- In artificial neural network, individual “neurons” (similar to kids in our example) specialize in recognizing specific aspects.
- When combined, they contribute to recognizing the overall concept (panda).
- The network refines its understanding through repeated exposure, similar to kids refining their panda recognition skills over time.
Input Layer (Observation):
Each kid observes one aspect, such as fur colour or face shape, forming the input layer of our network.
Hidden Layers (Processing):
The kids pass their observations to each other, mimicking the hidden layers of a neural network. As they share information, they collectively build a more comprehensive understanding of the panda’s features.
Output Layer (Recognition):
Finally, they reach a conclusion by combining all the details. If the majority agrees that the observed characteristics match those of a panda, they output “panda.” This output layer corresponds to the network’s final decision.
Scoring Approach:
To refine their recognition skills, the kids keep track of their accuracy.
- If they correctly identify a panda, they gain points;
- otherwise, they learn from their mistakes.
- Similarly, in neural networks, a scoring approach helps adjust the network’s parameters to enhance accuracy over time.
This teamwork illustrates how artificial neural networks process information layer by layer, learning from various features and refining their understanding through a scoring mechanism.
Deep Neural Networks
A deep neural network (DNN) is an artificial neural network (ANN) with multiple layers between the input and output layers.
Here “Deep” means it has multiple layers between the input and output, making it capable of learning complex patterns.
Important Points about Deep Learning
Now, let’s summarize some important points on Deep Learning!
Subset of ML
Deep learning is the subset of machine learning, which is in turn subset of AI.
Inspired by the Brain
Deep learning is based on artificial neural networks which is inspired by how our brains work.
Artificial Neural Networks (ANN)
ANN is a computational network which mimics biological neural networks in human brain.
Deep Neural Networks
The adjective “deep” refers to the use of multiple layers in the network. It uses deep neural networks with more than one hidden layer.
These layers process information, allowing the system to learn complex patterns.
Learning from Data
The system learns by being shown lots of examples and adjusting connections between neurons based on the differences between predictions and correct answers.
Handling Complex Problems
Deep learning is particularly effective for solving complex problems where traditional approaches may struggle.
Machine Learning Vs Deep Learning
Let’s break down the major differences between machine learning and deep learning:
Summary
In this blog, we’ve understood what deep learning is and how it works. Deep learning, aptly named for its multi-layered neural networks, which is similar to a human brain neural network with multiple layers of thinking, each level contributing to a deeper understanding of the information it processes.
From recognizing images and understanding speech to powering voice assistants and autonomous vehicles, deep learning has been beneficial to solve many complex tasks.
Part 4 — Introduction to Generative AI
What is Generative AI?
Generative AI is:
- A type of artificial intelligence
- that can create new things, for example artwork, music, or even realistic images.
- without being explicitly told what to create
While traditional AI focuses on specific tasks or solving a problem, Generative AI is distinguished by its ability to exhibit creativity similar to human creativity. Generative AI is capable of generating new, unique content, ideas, or solutions as we human do.
Let’s understand it better with an example!
Imagine I asked you to draw an animal you have never seen before. You need to use your imagination and draw a brand-new animal the world has never seen.
Since we human have imaginative power and creativity, you will be able to do that. Maybe you will draw an animal that has the body of a lion, face of a cow and the wings of a butterfly.
Now, what if a computer program could create new things all by itself! It can create new things, for example artwork, music, or even realistic images, without being explicitly told what to create.
The computer program has been given lots of pictures of lions, cow and butterflies. Now, with this knowledge, it can draw a completely new animal, say a “lion-cow-butterfly” combination. It doesn’t copy any existing image; instead, it uses its understanding of what makes lion, cow, and butterfly unique to create something entirely new something as below.
This is Generative AI — A machine (or computer) which has imagination and creativity to draw pictures, tell stories, or even make up new games without anyone showing it how.
Where Does Generative AI Fits into AI Hierarchy?
Generative AI is a subset of Deep Learning. Below diagram shows the relation between AI, Machine Learning, Deep Learning and Generative AI.
Generative AI leverages machine learning techniques, particularly deep learning and neural networks.
The main differentiator of Generative AI is the ability to generate new content.
AI, machine learning and even deep learning is mostly limited to predictive models. These are mainly used to observe and classify patterns in content or predict a new pattern or content. For example, a classic machine learning use-case is to identify image of a cat out of several given images or classify animals in different clusters based on various properties.
Generative AI is a breakthrough, because it has the ability to do something only humans were supposed to do — create an image of a cat or create an image of a totally new animal from it’s creativity.
The following image shows the evolution of AI with time. The evolution of AI from traditional rule-based systems to Generative AI has been driven by advancements in learning algorithms, computational power, and access to vast amounts of data.
Generative Models
Generative AI uses different types of machine learning models, called Generative Models.
The generative models:
- learns the underlying set of data and generates new data the closely mimics the original data
- are mainly used to create new content, such as images, text, or even music which looks exactly the same as what might be created by humans
- Usages unsupervised learning approach
Most common generative models are:
- Variational Autoencoders (VAEs),
- Generative Adversarial Networks (GANs)
- Limited Boltzmann Machines (RBMs)
- Transformer-based Language Models
In the next chapter, we will learn more about generative models.
Usages of Generative AI in Real-life
Here are some examples of how generative AI is being used to create real-life applications:
Text Generation
Most of us have used ChatGPT which is based on Generative AI. Similar to ChatGPT Generative AI based tools can be used to generate new content such as articles, reports, poetry, stories or any other text-based content.
One of the most common uses of generative AI is to build Virtual Assistants and Chatbots. Generative models are used to build advance chatbots which can interact with users mimicking human interaction.
Image Generation
Generative AI tools are used to generate new pictures even creative ones using various generative models. These models can learn from large sets of images and generate new unique images based on trained data. These models can even generate images with creativity based on input prompts similar to content generated by humans. There are various ways this can be used in real-life applications such as image-to-image translation, text-to-image translation, photograph editing, face generation, image quality enhancement etc.
One of the most common tools which usages generative AI to create realistic images and art is DALL-E, developed by OpenAI. It is a text-to-image model, which usages deep learning to generate digital images from natural language descriptions.
Video Generation
Generative models can be used to create whole videos from scratch. It stitches together scenes, characters, and actions to make a story. These videos can be used for entertainment, advertisements, or even training simulations. Video game development is one field which is heavily using generative AI.
Some generative models can be used to create new videos by learning from existing videos. This can be used for video prediction if an existing video such as security clip is damaged.
Voice Generation
Generative AI can also mimic voices or generate a whole new voice! It can learn how people talk by analysing audio data, and then generate voice in same style or create entirely new voices.
This is useful for making virtual assistants or audiobooks sound more natural.
Healthcare Applications
Generative AI models can be used to generate synthetic data samples that resemble real data. This can be very useful in medical field, where sometimes collecting real-world data is expensive or limited. For example, generative AI can be used to generating synthetic patient data for research purposes.
Drug Discovery
Generative AI is being used in drug discovery to generate new molecular structures with desired properties. This helps accelerate the process of drug development by exploring vast chemical spaces and identifying promising drug candidates.
Gaming
Generative AI has truly changed the world of gaming. It is increasingly being used in the gaming industry to accelerate game production and create unique experiences.
It helps game developers make games more exciting and immersive by creating entire worlds, characters, and stories.
Generative AI can also be used to make virtual worlds more realistic. It can be used to create unique creatures and characters, finetune each character’s personality and traits, making the game feel alive and full of surprises.
Art Generation
This is one major usage that distinguish generative AI from regular AI. Generative AI has the capability of creative thinking like we human do. Various generative models are used in generative artistic artifacts such as paintings, poetries, stories, and other multimedia-based arts.
Software Development
Generative AI has totally changed the way we write code and build software. With Generative AI tools such as GitHub Copilot, ChatGPT, AlphaCode, we can write code much faster with fine details.
Generative AI tools can assist developers by generating code snippets, enhancing software testing efficiency by identifying more defects, and suggesting optimal solutions to coding challenges. This results in faster development cycles and higher code quality, ultimately leading to improved software products and enhanced user experiences.
Finance
Financial institutions are using generative AI to analyse market trends, forecast stock movements with a high accuracy rate, and refine trading strategies. The technology also helps us having better risk assessment, fraud detection, and portfolio optimization, leading to increased efficiency, reduced costs, more profitability and better investment choices.
Example of Some Popular Generative AI Tools
In previous section, we talked about various use-cases of generative AI. Now, let’s have a look into some of popular generative AI tools available currently.
ChatGPT
ChatGPT is a conversational AI developed by OpenAI. It is designed to engage in natural language conversations with users, providing responses that are contextually relevant and coherent.
ChatGPT works by processing input text and generating responses based on the patterns and relationships it has learned from vast amounts of training data. It usages deep learning techniques, specifically transformers, which allow it to understand and generate human-like text.
GPT (Generative Pre-trained Transformer)
GPT is a transformer-based large language model, developed by OpenAI. This is the engine behind ChatGPT.
The free version of ChatGPT is based on GPT 3.5, while the more advanced GPT-4 based version, is provided to paid subscribers under the commercial name “ChatGPT Plus”.
AlphaCode
AlphaCode is a transformer-based language mode, developed by DeepMind. It is an AI-powered coding engine that generates computer programs. AlphaCode is more complex than many existing language models, with 41.4 billion parameters.
The tool leverages deep learning algorithms to analyse huge amounts of code and learn from patterns, enabling it to generate optimized code solutions. It supports a wide range of programming languages, including C#, Ruby, Python, Java, C++, and more.
GitHub Copilot
GitHub Copilot is an AI-powered code completion tool developed by GitHub in collaboration with OpenAI. It integrates directly into code editors like Visual Studio Code and provides real-time suggestions and completions for code as developers write.
It’s designed to assist developers by generating code snippets, suggesting entire lines or blocks of code, and providing contextual documentation. GitHub Copilot supports multiple programming languages such as Python, JavaScript, Java, C++, and more.
Bard
Bard is a conversational Generative AI chatbot developed by Google, as a direct response to the swift rise of OpenAI’s ChatGPT. Bard was initially based on LaMDA, a transformer-based model. Later it got upgraded to other models such as PaLM and Gemini.
Microsoft Copilot
Microsoft Copilot was initially launched by Microsoft in 2023 as an AI-powered assistant that can help to browse the web. Later it got rebranded to Microsoft Copilot.
Microsoft Copilot can be used to request summaries of articles, books, news etc., general text and images, reformat text, update images etc.
DALL-E
Developed by OpenAI, DALL-E (other versions are DALL-E2 and DALL-E3) is one of the best generative AI tools to generate images. It uses deep learning algorithms to generate images from texts.
StyleGAN
StyleGAN, developed by NVIDIA, is a generative model of type GAN (Generative Adversarial Network), which is used to generate high-quality synthetic images.
StyleGAN is extremely good in creation of realistic images of human faces and other visual content. It can generate images of human faces with a high degree of control over specific visual features such as facial attributes, pose, and background.
Below are some images generated by StyleGAN that looks like a real person. There is an interesting site https://this-person-does-not-exist.com which demonstrates how StyleGAN can be used to generate human faces which actually don’t exists.
Summary
In this blog, we got a clear idea on what generative AI is and how it is different from other AI types. We also touched upon various real-life use-cases of generative AI and some popular generative AI tools available.
Comments