Building Towards Computer Use with Anthropic
Instructor: Colt Steele
- Beginner
- 1 Hour 35 Minutes
- 9 Video Lessons
- 6 Code Examples
- Instructor: Colt Steele
What you'll learn
Learn about Anthropic’s family of models, its approach to AI research, and the best way to prompt it, including multi-modal use cases.
Learn effective prompting techniques, implement prompt caching to reduce costs and latency, and build AI applications that can call tools.
See how you can combine multimodal capabilities, agentic workflows, and tool use to build an AI assistant that can navigate and interact with computer interfaces, executing tasks like web searches.
About this course
Building Towards Computer Use with Anthropic introduces an innovative capability from Anthropic that enables models to interact with and navigate computer interfaces.
Taught by Colt Steele, Anthropic’s Head of Curriculum, this course covers Anthropic’s family of models and the building blocks that lead to the amazing new application – Computer Use.
Computer Use utilizes the capabilities of the latest models including image reasoning and tool use to enable an LLM-based agent to use a computer. Like a human user, the model processes an image of the screen, analyzes it to understand what’s going on, and navigates the computer by issuing mouse clicks and generating keyboard strokes to get things done.
In this course, you’ll learn the features that lead up to computer use from working with the Anthropic’s API, to multimodal prompting, prompt caching, and tool use, ending in a demo that combines these features to build an AI assistant that uses a computer.
In detail, you’ll:
- Learn Anthropic’s approach to AI research, principles of AI safety, alignment, and interpretability while understanding the key differences between its models.
- Make API requests to Claude, format messages for better responses, and control API parameters like system prompts, temperature, and max tokens for optimal responses.
- Write multi-modal prompts that combine text and image content blocks and build with streaming responses.
- Learn effective prompting techniques such as using prompt templates, structuring prompts in XML, and providing examples to get consistent high-quality responses.
- Learn to implement prompt caching and see how it can reduce costs and latency.
- Understand tool-use workflows and build a chatbot that can call different tools in response to users’ queries.
- See all these concepts come together in a demo that uses Anthropic Computer Use to achieve a task on a computer.
Start utilizing Anthropic’s family of models to build towards Computer Use applications.
Who should join?
Anyone who has basic Python knowledge, wants to learn how to use all of the features of the Anthropic family of models, and understand the capabilities of computer use applications.
Course Outline
9 Lessons・6 Code ExamplesIntroduction
Video・3 mins
Overview
Video・5 mins
Working with the API
Video with code examples・15 mins
Multimodal Requests
Video with code examples・12 mins
Real World Prompting
Video with code examples・17 mins
Prompt Caching
Video with code examples・12 mins
Tool Use
Video with code examples・17 mins
Computer Use
Video・10 mins
Conclusion
Video・1 min
Appendix – Tips and Help
Code examples・1 min
Instructor
Colt Steele
Head of Curriculum at Anthropic
Building Towards Computer Use with Anthropic
- Beginner
- 1 Hour 35 Minutes
- 9 Video Lessons
- 6 Code Examples
- Instructor: Colt Steele
Course access is free for a limited time during the DeepLearning.AI learning platform beta!
Want to learn more about Generative AI?
Keep learning with updates on curated AI news, courses, and events, as well as Andrew’s thoughts from DeepLearning.AI!