Open Source Models with Hugging Face
Instructors: Maria Khalusova, Marc Sun, Younes Belkada
- Beginner
- 2 Hours 1 Minute
- 16 Video Lessons
- 13 Code Examples
- Instructors: Maria Khalusova, Marc Sun, Younes Belkada
What you'll learn
Find and filter open source models on Hugging Face Hub based on task, rankings, and memory requirements.
Write just a few lines of code using the transformers library to perform text, audio, image, and multimodal tasks.
Easily share your AI apps with a user-friendly interface or via API and run them on the cloud using Gradio and Hugging Face Spaces.
About this course
The availability of models and their weights for anyone to download enables a broader range of developers to innovate and create.
In this course, you’ll select open source models from Hugging Face Hub to perform NLP, audio, image and multimodal tasks using the Hugging Face transformers library. Easily package your code into a user-friendly app that you can run on the cloud using Gradio and Hugging Face Spaces.
You will:
- Use the transformers library to turn a small language model into a chatbot capable of multi-turn conversations to answer follow-up questions.
- Translate between languages, summarize documents, and measure the similarity between two pieces of text, which can be used for search and retrieval.
- Convert audio to text with Automatic Speech Recognition (ASR), and convert text to audio using Text to Speech (TTS).
- Perform zero-shot audio classification, to classify audio without fine-tuning the model.
- Generate an audio narration describing an image by combining object detection and text-to-speech models.
- Identify objects or regions in an image by prompting a zero-shot image segmentation model with points to identify the object that you want to select.
- Implement visual question answering, image search, image captioning and other multimodal tasks.
- Share your AI app using Gradio and Hugging Face Spaces to run your applications in a user-friendly interface on the cloud or as an API.
The course will provide you with the building blocks that you can combine into a pipeline to build your AI-enabled applications!
Who should join?
Anyone who wants to get started building AI applications quickly and easily using open source models.
Course Outline
16 Lessons・13 Code ExamplesIntroduction
Video・5 mins
Selecting models
Video・5 mins
Natural Language Processing (NLP)
Video with code examples・10 mins
Translation and Summarization
Video with code examples・5 mins
Sentence Embeddings
Video with code examples・5 mins
Zero-Shot Audio Classification
Video with code examples・10 mins
Automatic Speech Recognition
Video with code examples・15 mins
Text to Speech
Video with code examples・2 mins
Object Detection
Video with code examples・11 mins
Image Segmentation
Video with code examples・16 mins
Image Retrieval
Video with code examples・7 mins
Image Captioning
Video with code examples・5 mins
Multimodal Visual Question Answering
Video with code examples・4 mins
Zero-Shot Image Classification
Video with code examples・4 mins
Deployment
Video with code examples・11 mins
Conclusion
Video・1 mins
Instructors
Open Source Models with Hugging Face
- Beginner
- 2 Hours 1 Minute
- 16 Video Lessons
- 13 Code Examples
- Instructors: Maria Khalusova, Marc Sun, Younes Belkada
Course access is free for a limited time during the DeepLearning.AI learning platform beta!
Want to learn more about Generative AI?
Keep learning with updates on curated AI news, courses, and events, as well as Andrew’s thoughts from DeepLearning.AI!