Short CourseBeginner2 Hours 1 Minute

Open Source Models with Hugging Face

Instructors: Maria Khalusova, Marc Sun, Younes Belkada

Hugging Face
  • Beginner
  • 2 Hours 1 Minute
  • 16 Video Lessons
  • 13 Code Examples
  • Instructors: Maria Khalusova, Marc Sun, Younes Belkada
    • Hugging Face
    Hugging Face

What you'll learn

  • Find and filter open source models on Hugging Face Hub based on task, rankings, and memory requirements.

  • Write just a few lines of code using the transformers library to perform text, audio, image, and multimodal tasks.

  • Easily share your AI apps with a user-friendly interface or via API and run them on the cloud using Gradio and Hugging Face Spaces.

About this course

The availability of models and their weights for anyone to download enables a broader range of developers to innovate and create.

In this course, you’ll select open source models from Hugging Face Hub to perform NLP, audio, image and multimodal tasks using the Hugging Face transformers library. Easily package your code into a user-friendly app that you can run on the cloud using Gradio and Hugging Face Spaces.

You will:

  • Use the transformers library to turn a small language model into a chatbot capable of multi-turn conversations to answer follow-up questions.
  • Translate between languages, summarize documents, and measure the similarity between two pieces of text, which can be used for search and retrieval.
  • Convert audio to text with Automatic Speech Recognition (ASR), and convert text to audio using Text to Speech (TTS).
  • Perform zero-shot audio classification, to classify audio without fine-tuning the model.
  • Generate an audio narration describing an image by combining object detection and text-to-speech models.  
  • Identify objects or regions in an image by prompting a zero-shot image segmentation model with points to identify the object that you want to select.
  • Implement visual question answering, image search, image captioning and other multimodal tasks.
  • Share your AI app using Gradio and Hugging Face Spaces to run your applications in a user-friendly interface on the cloud or as an API. 

The course will provide you with the building blocks that you can combine into a pipeline to build your AI-enabled applications!

Who should join?

Anyone who wants to get started building AI applications quickly and easily using open source models.

Course Outline

16 Lessons・13 Code Examples
  • Introduction

    Video5 mins

  • Selecting models

    Video5 mins

  • Natural Language Processing (NLP)

    Video with code examples10 mins

  • Translation and Summarization

    Video with code examples5 mins

  • Sentence Embeddings

    Video with code examples5 mins

  • Zero-Shot Audio Classification

    Video with code examples10 mins

  • Automatic Speech Recognition

    Video with code examples15 mins

  • Text to Speech

    Video with code examples2 mins

  • Object Detection

    Video with code examples11 mins

  • Image Segmentation

    Video with code examples16 mins

  • Image Retrieval

    Video with code examples7 mins

  • Image Captioning

    Video with code examples5 mins

  • Multimodal Visual Question Answering

    Video with code examples4 mins

  • Zero-Shot Image Classification

    Video with code examples4 mins

  • Deployment

    Video with code examples11 mins

  • Conclusion

    Video1 mins

Instructors

Maria Khalusova

Maria Khalusova

Member of Technical Staff at Hugging Face

Marc Sun

Marc Sun

Machine Learning Engineer at Hugging Face

Younes Belkada

Younes Belkada

Machine Learning Engineer at Hugging Face

Course access is free for a limited time during the DeepLearning.AI learning platform beta!

Want to learn more about Generative AI?

Keep learning with updates on curated AI news, courses, and events, as well as Andrew’s thoughts from DeepLearning.AI!