{"id":918900,"date":"2023-02-20T20:01:28","date_gmt":"2023-02-21T04:01:28","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-blog-post&#038;p=918900"},"modified":"2023-03-03T10:55:26","modified_gmt":"2023-03-03T18:55:26","slug":"chatgpt-for-robotics","status":"publish","type":"msr-blog-post","link":"https:\/\/www.microsoft.com\/en-us\/research\/articles\/chatgpt-for-robotics\/","title":{"rendered":"ChatGPT for Robotics: Design Principles and Model Abilities"},"content":{"rendered":"\n<p class=\"has-gray-color has-text-color\" style=\"font-size:1.2em\">We extended the capabilities of ChatGPT to robotics, and controlled multiple platforms such as robot arms, drones, and home assistant robots intuitively with language.<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"600\" height=\"338\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/chatgpt_robotics_gif.gif\" alt=\"main gif of multiple robots\" class=\"wp-image-921120\" \/><\/figure>\n\n\n\n<div class=\"wp-block-group is-layout-constrained wp-block-group-is-layout-constrained\">\n<div class=\"wp-block-columns is-layout-flex wp-container-core-columns-is-layout-9d6595d7 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\" style=\"flex-basis:100%\">\n<div class=\"wp-block-group is-layout-constrained wp-block-group-is-layout-constrained\">\n<div class=\"wp-block-group is-layout-constrained wp-block-group-is-layout-constrained\">\n<div class=\"wp-block-buttons is-content-justification-center is-content-justification-center is-layout-flex wp-container-core-buttons-is-layout-5e6b19ed wp-block-buttons-is-layout-flex\">\n<div class=\"wp-block-button aligncenter has-custom-width wp-block-button__width-50 is-style-fill-download is-style-fill\"><a data-bi-type=\"button\" class=\"wp-block-button__link wp-element-button\" href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/ChatGPT___Robotics.pdf\" target=\"_blank\" rel=\"noreferrer noopener\">Paper<\/a><\/div>\n\n\n\n<div class=\"wp-block-button aligncenter is-style-outline is-style-outline--1\"><a data-bi-type=\"button\" class=\"wp-block-button__link wp-element-button\" href=\"https:\/\/youtu.be\/NYd0QcZcS6Q\" target=\"_blank\" rel=\"noreferrer noopener\">Video<\/a><\/div>\n\n\n\n<div class=\"wp-block-button is-style-fill-github\"><a data-bi-type=\"button\" class=\"wp-block-button__link wp-element-button\" href=\"https:\/\/github.com\/microsoft\/PromptCraft-Robotics\" target=\"_blank\" rel=\"noreferrer noopener\">PromptCraft code<\/a><\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n<\/div>\n\n\n\n<p>Have you ever wanted to tell a robot what to do using your own words, like you would to a human? Wouldn&#8217;t it be amazing to just tell your home assistant robot: &#8220;<em>Please warm up my lunch<\/em>&#8220;, and have it find the microwave by itself? Even though language is the most intuitive way for us to express our intentions, we still rely heavily on hand-written code to control robots. Our team has been exploring how we can change this reality and make natural human-robot interactions possible using <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/openai.com\/\" target=\"_blank\" rel=\"noopener noreferrer\">OpenAI<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>&#8216;s new AI language model, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/openai.com\/blog\/chatgpt\/\" target=\"_blank\" rel=\"noopener noreferrer\">ChatGPT<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>.<\/p>\n\n\n\n<p>ChatGPT is a language model trained on a massive corpus of text and human interactions, allowing it to generate coherent and grammatically correct responses to a wide range of prompts and questions. Our goal with this research is to see if ChatGPT can think beyond text, and reason about the physical world to help with robotics tasks. We want to help people interact with robots more easily, without needing to learn complex programming languages or details about robotic systems. The key challenge here is teaching ChatGPT how to solve problems considering the laws of physics, the context of the operating environment, and how the robot&#8217;s physical actions can change the state of the world. <\/p>\n\n\n\n<p>It turns out that ChatGPT can do a lot by itself, but it still needs some help. Our <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/ChatGPT___Robotics.pdf\" target=\"_blank\" rel=\"noreferrer noopener\">technical paper<\/a> describes a series of design principles that can be used to guide language models towards solving robotics tasks. These include, and are not limited to, special prompting structures, high-level APIs, and human feedback via text. We believe that our work is just the start of a shift in how we develop robotics systems, and we hope to inspire other researchers to jump into this exciting field. Continue reading for more technical details on our methods and ideas.<\/p>\n\n\n\n<figure class=\"wp-block-embed is-provider-youtube wp-block-embed-youtube\"><div class=\"wp-block-embed__wrapper\">\n<iframe loading=\"lazy\" title=\"ChatGPT for Robotics\" width=\"500\" height=\"281\" src=\"https:\/\/www.youtube-nocookie.com\/embed\/NYd0QcZcS6Q?feature=oembed&rel=0\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe>\n<\/div><\/figure>\n\n\n\n<h3 id=\"challenges-in-robotics-today-and-how-chatgpt-can-help\">Challenges in robotics today, and how ChatGPT can help<\/h3>\n\n\n\n<p>Current robotics pipelines begin with an engineer or technical user that needs to translate the task&#8217;s requirements into code for the system. The engineer sits <em>in the loop<\/em>, meaning that they need to write new code and specifications to correct the robot&#8217;s behavior. Overall, this process is slow (user needs to write low-level code), expensive (requires highly skilled users with deep knowledge of robotics), and inefficient (requires multiple interactions to get things working properly).<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"2362\" height=\"714\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/main.jpg\" alt=\"robotics today versus with chatgpt\" class=\"wp-image-920934\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/main.jpg 2362w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/main-300x91.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/main-1024x310.jpg 1024w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/main-768x232.jpg 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/main-1536x464.jpg 1536w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/main-2048x619.jpg 2048w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/main-240x73.jpg 240w\" sizes=\"auto, (max-width: 2362px) 100vw, 2362px\" \/><\/figure>\n\n\n\n<p>ChatGPT unlocks a new robotics paradigm, and allows a (potentially non-technical) user to sit <em>on the loop<\/em>, providing high-level feedback to the large language model (LLM) while monitoring the robot&#8217;s performance. By following our set of design principles, ChatGPT can generate code for robotics scenarios. Without any fine-tuning we leverage the LLM&#8217;s knowledge to control different robots form factors for a variety of tasks. In our work we show multiple examples of ChatGPT solving robotics puzzles, along with complex robot deployments in the manipulation, aerial, and navigation domains.<\/p>\n\n\n\n<h3 id=\"robotics-with-chatgpt-design-principles\">Robotics with ChatGPT: design principles<\/h3>\n\n\n\n<p>Prompting LLMs is a highly empirical science. Through trial and error, we built a methodology and a set of design principles for writing prompts for robotics tasks:<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"2021\" height=\"450\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/steps.png\" alt=\"new pipeline with chatgpt\" class=\"wp-image-920955\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/steps.png 2021w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/steps-300x67.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/steps-1024x228.png 1024w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/steps-768x171.png 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/steps-1536x342.png 1536w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/steps-240x53.png 240w\" sizes=\"auto, (max-width: 2021px) 100vw, 2021px\" \/><\/figure>\n\n\n\n<ol class=\"wp-block-list\">\n<li>First, we define a set of high-level robot APIs or function library. This library can be specific to a particular robot, and should map to existing low-level implementations from the robot&#8217;s control stack or a perception library. It&#8217;s very important to use descriptive names for the high-level APIs so ChatGPT can reason about their behaviors;<\/li>\n\n\n\n<li>Next, we write a text prompt for ChatGPT which describes the task goal while also explicitly stating which functions from the high-level library are available. The prompt can also contain information about task constraints,<br>or how ChatGPT should form its answers (specific coding language, using auxiliary parsing elements);<\/li>\n\n\n\n<li>The user stays on the loop to evaluate ChatGPT&#8217;s code output, either through direct inspection or using a  simulator. If needed, the user uses natural language to provide feedback to ChatGPT on the answer&#8217;s quality and safety.<\/li>\n\n\n\n<li>When the user is happy with the solution, the final code can be deployed onto the robot.<\/li>\n<\/ol>\n\n\n\n<h3 id=\"enough-theory-what-exactly-can-chatgpt-do\">Enough theory&#8230; What exactly can ChatGPT do?<\/h3>\n\n\n\n<p>Let&#8217;s take a look at a few examples&#8230; You can find even more case studies <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/github.com\/microsoft\/PromptCraft-Robotics\" target=\"_blank\" rel=\"noopener noreferrer\">in our code repository<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>. <\/p>\n\n\n\n<h4 id=\"zero-shot-task-planning\">Zero-shot task planning<\/h4>\n\n\n\n<p>We gave ChatGPT access to functions that control a real drone, and it proved to be an extremely intuitive language-based interface between the non-technical user and the robot. ChatGPT asked clarification questions when the user&#8217;s instructions were ambiguous, and wrote complex code structures for the drone such as a zig-zag pattern to visually inspect shelves. It even figured out how to take a selfie! \ud83d\udcf7 \ud83d\ude0e<\/p>\n\n\n\n<figure class=\"wp-block-embed is-provider-youtube wp-block-embed-youtube\"><div class=\"wp-block-embed__wrapper\">\n<iframe loading=\"lazy\" title=\"ChatGPT + Real Drone\" width=\"500\" height=\"281\" src=\"https:\/\/www.youtube-nocookie.com\/embed\/i5wZJFb4dyA?feature=oembed&rel=0\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe>\n<\/div><\/figure>\n\n\n\n<p>We also used ChatGPT in a simulated industrial inspection scenario with the <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/github.com\/microsoft\/AirSim\" target=\"_blank\" rel=\"noopener noreferrer\">Microsoft AirSim simulator<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>. The model was able to effectively parse the user&#8217;s high-level intent and geometrical cues to control the drone accurately.<\/p>\n\n\n\n<figure class=\"wp-block-embed is-provider-youtube wp-block-embed-youtube\"><div class=\"wp-block-embed__wrapper\">\n<iframe loading=\"lazy\" title=\"ChatGPT + AirSim\" width=\"500\" height=\"281\" src=\"https:\/\/www.youtube-nocookie.com\/embed\/38lA3U2J43w?feature=oembed&rel=0\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe>\n<\/div><\/figure>\n\n\n\n<h4 id=\"user-on-the-loop-when-a-conversation-is-needed-for-a-complex-tasks\">User on the loop: when a conversation is needed for a complex tasks<\/h4>\n\n\n\n<p>Next, we used ChatGPT in a manipulation scenario with a robot arm. We used conversational feedback to teach the model how to compose the originally provided APIs into more complex high-level functions: that ChatGPT coded by itself. Using a curriculum-based strategy, the model was able to chain these learned skills together logically to perform operations such as stacking blocks. <\/p>\n\n\n\n<p>In addition, the model displayed a fascinating example of bridging the textual and physical domains when tasked with building the Microsoft logo out of wooden blocks. Not only was it able to recall the logo from its internal knowledge base, it was able to &#8216;draw&#8217; the logo (as SVG code), and then use the skills learned above to figure out which existing robot actions can compose its physical form.<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full is-resized\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/Screenshot-2023-02-20-105158.png\" alt=\"Excerpt from ChatGPT conversation where it recalls the Microsoft logo from its knowledge base and draws it using SVG code.\" class=\"wp-image-921279\" width=\"558\" height=\"423\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/Screenshot-2023-02-20-105158.png 1115w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/Screenshot-2023-02-20-105158-300x227.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/Screenshot-2023-02-20-105158-1024x776.png 1024w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/Screenshot-2023-02-20-105158-768x582.png 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/Screenshot-2023-02-20-105158-80x60.png 80w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/Screenshot-2023-02-20-105158-238x180.png 238w\" sizes=\"auto, (max-width: 558px) 100vw, 558px\" \/><\/figure>\n\n\n\n<figure class=\"wp-block-embed is-provider-youtube wp-block-embed-youtube\"><div class=\"wp-block-embed__wrapper\">\n<iframe loading=\"lazy\" title=\"ChatGPT + Manipulation\" width=\"500\" height=\"281\" src=\"https:\/\/www.youtube-nocookie.com\/embed\/wLOChUtdqoA?feature=oembed&rel=0\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe>\n<\/div><\/figure>\n\n\n\n<p>Next, we tasked ChatGPT to write an algorithm for a drone to reach a goal in space while not crashing into obstacles. We told the model that this drone has a forward facing distance sensor, and ChatGPT coded most of the key building blocks for the algorithm right away. This task required some conversation with the human, and we were impressed by ChatGPT&#8217;s ability to make localized code improvements using only language feedback.<\/p>\n\n\n\n<figure class=\"wp-block-embed is-provider-youtube wp-block-embed-youtube\"><div class=\"wp-block-embed__wrapper\">\n<iframe loading=\"lazy\" title=\"ChatGPT - Aerial Obstacle Avoidance\" width=\"500\" height=\"281\" src=\"https:\/\/www.youtube-nocookie.com\/embed\/Vn6NapLlHPE?feature=oembed&rel=0\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe>\n<\/div><\/figure>\n\n\n\n<h4 id=\"perception-action-loops-robots-that-sense-the-world-before-they-act\">Perception-action loops: robots that sense the world before they act<\/h4>\n\n\n\n<p>The ability to sense the world (perception) before doing something (action) is fundamental to any robotics system. Therefore, we decided to test ChatGPT&#8217;s understanding of this concept and asked it to explore an environment until finding a user-specified object. We gave the model access to functions such as object detection and object distance APIs, and verified that the code it generated successfully implemented a perception-action loop.  <\/p>\n\n\n\n<p>In experimental character, we ran additional experiments to evaluate if ChatGPT is able to decide where the robot should go based on sensor feedback in real time (as opposed to having ChatGPT generate a code loop that makes these decisions). Interestingly, we verified that we can feed a textual description of the camera image at each step into the chat, and the model was able to figure out how to control the robot until it reaches a particular object.  <\/p>\n\n\n\n<figure class=\"wp-block-embed is-provider-youtube wp-block-embed-youtube\"><div class=\"wp-block-embed__wrapper\">\n<iframe loading=\"lazy\" title=\"ChatGPT + Embodied AI\" width=\"500\" height=\"281\" src=\"https:\/\/www.youtube-nocookie.com\/embed\/p0fDH9zZm_c?feature=oembed&rel=0\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe>\n<\/div><\/figure>\n\n\n\n<h3 id=\"promptcraft-a-collaborative-open-sourced-tool-for-llmrobotics-research\">PromptCraft, a collaborative open-sourced tool for LLM+Robotics research<\/h3>\n\n\n\n<p>Good prompt engineering is crucial for the success of LLMs such as ChatGPT for robotics tasks. Unfortunately, prompting is an empirical science, and there is a lack of comprehensive and accessible resources with good (and bad) examples to help researchers and enthusiasts in the field. To address this gap, we introduce <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/github.com\/microsoft\/PromptCraft-Robotics\" target=\"_blank\" rel=\"noopener noreferrer\">PromptCraft<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, a collaborative open-source platform where anyone can share examples of prompting strategies for different robotics categories. We release all of the prompts and conversations used in this study. We invite the readers to contribute with more! <\/p>\n\n\n\n<p>Besides prompt design, we hope to also include multiple robotics simulators and interfaces to allow users to test their ChatGPT-generated algorithms. As a start, we also release an AirSim environment with ChatGPT integration that anyone can use to get started with these ideas. We welcome contributions of new simulators and interfaces as well.<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1695\" height=\"1005\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/AirSimScreenshot.jpg\" alt=\"Screenshot of the ChatGPT - AirSim interface\" class=\"wp-image-921300\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/AirSimScreenshot.jpg 1695w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/AirSimScreenshot-300x178.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/AirSimScreenshot-1024x607.jpg 1024w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/AirSimScreenshot-768x455.jpg 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/AirSimScreenshot-1536x911.jpg 1536w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/AirSimScreenshot-240x142.jpg 240w\" sizes=\"auto, (max-width: 1695px) 100vw, 1695px\" \/><figcaption class=\"wp-element-caption\">The ChatGPT-AirSim interface<\/figcaption><\/figure>\n\n\n\n<figure class=\"wp-block-embed is-provider-youtube wp-block-embed-youtube\"><div class=\"wp-block-embed__wrapper\">\n<iframe loading=\"lazy\" title=\"ChatGPT AirSim Interface\" width=\"500\" height=\"281\" src=\"https:\/\/www.youtube-nocookie.com\/embed\/iE5tZ6_ZYE8?feature=oembed&rel=0\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe>\n<\/div><\/figure>\n\n\n\n<h3 id=\"bringing-robotics-out-of-labs-and-into-the-world\">Bringing robotics out of labs, and into the world<\/h3>\n\n\n\n<p>We are excited to release these technologies with the aim of bringing robotics to the reach of a wider audience. We believe that language-based robotics control will be fundamental to bring robotics out of science labs, and into the hands of everyday users. <\/p>\n\n\n\n<p>That said, we do emphasize that the outputs from ChatGPT are not meant to be deployed directly on robots without careful analysis. We encourage users to harness the power of simulations in order to evaluate these algorithms before potential real life deployments, and to always take the necessary safety precautions. Our work represents only a small fraction of what is possible within the intersection of large language models operating in the robotics space, and we hope to inspire much of the work to come.<\/p>\n\n\n\n<h6 id=\"citation\">Citation<\/h6>\n\n\n\n<p>If you find <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2023\/02\/ChatGPT___Robotics.pdf\" target=\"_blank\" rel=\"noreferrer noopener\">this work<\/a> useful in your research, please cite us as<\/p>\n\n\n\n<p><code>@techreport{vemprala2023chatgpt,<br>author = {Vemprala, Sai and Bonatti, Rogerio and Bucker, Arthur and Kapoor, Ashish},<br>title = {ChatGPT for Robotics: Design Principles and Model Abilities},<br>institution = {Microsoft},<br>year = {2023},<br>month = {February},<br>url = {https:\/\/www.microsoft.com\/en-us\/research\/publication\/chatgpt-for-robotics-design-principles-and-model-abilities\/},<br>number = {MSR-TR-2023-8},<br>}<\/code><\/p>\n\n\n\n<p><em>This work is being undertaken by members of the <\/em><a href=\"https:\/\/www.microsoft.com\/en-us\/ai\/autonomous-systems\" target=\"_blank\" rel=\"noreferrer noopener\"><em>Microsoft Autonomous Systems and Robotics Research Group<\/em><\/a>. <em>The researchers included in this project are: <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/savempra\/\" target=\"_blank\" rel=\"noreferrer noopener\"><em>Sai Vemprala<\/em><\/a><\/em>, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"http:\/\/rogeriobonatti.com\/\" target=\"_blank\" rel=\"noopener noreferrer\"><em>Rogerio Bonatti<\/em><span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/scholar.google.com.br\/citations?user=8cEgwaEAAAAJ&hl=en&oi=ao\" target=\"_blank\" rel=\"noopener noreferrer\"><em>Arthur Bucker<\/em><span class=\"sr-only\"> (opens in new tab)<\/span><\/a><em>, and <\/em><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/akapoor\/\" target=\"_blank\" rel=\"noreferrer noopener\"><em>Ashish Kapoor<\/em><\/a><em>. <\/em><\/p>\n","protected":false},"excerpt":{"rendered":"<p>We extended the capabilities of ChatGPT to robotics, and controlled multiple platforms such as robot arms, drones, and home assistant robots intuitively with language. Have you ever wanted to tell a robot what to do using your own words, like you would to a human? Wouldn&#8217;t it be amazing to just tell your home assistant [&hellip;]<\/p>\n","protected":false},"author":41419,"featured_media":921120,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr-content-parent":867219,"msr_hide_image_in_river":0,"footnotes":""},"research-area":[],"msr-locale":[268875],"msr-post-option":[],"class_list":["post-918900","msr-blog-post","type-msr-blog-post","status-publish","has-post-thumbnail","hentry","msr-locale-en_us"],"msr_assoc_parent":{"id":867219,"type":"group"},"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post\/918900","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-blog-post"}],"author":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/users\/41419"}],"version-history":[{"count":93,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post\/918900\/revisions"}],"predecessor-version":[{"id":1007493,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post\/918900\/revisions\/1007493"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media\/921120"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=918900"}],"wp:term":[{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=918900"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=918900"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=918900"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}