  {"id":18352,"date":"2026-03-20T13:07:59","date_gmt":"2026-03-20T17:07:59","guid":{"rendered":"https:\/\/www.rochester.edu\/coe\/?p=18352"},"modified":"2026-03-20T13:07:59","modified_gmt":"2026-03-20T17:07:59","slug":"leverage-large-language-models-for-complex-robot-manipulation","status":"publish","type":"post","link":"https:\/\/www.rochester.edu\/coe\/leverage-large-language-models-for-complex-robot-manipulation\/","title":{"rendered":"Leverage Large Language Models for Complex Robot Manipulation"},"content":{"rendered":"<p>PI Researcher: Chenliang Xu, Ä¢¹½´«Ã½<br \/>\nCompany Partner: Corning<\/p>\n<p>In robot manipulation, adapting to dynamic environments with flexible task specifications is challenging. Language-based vision manipulation systems offer a solution by linking language instructions to visual data and generating actions. However, current approaches often develop vision models and action policies separately, leading to poor integration. To address this, we propose ACTLLM, a method that unifies visual interpretation and policy learning using large language models (LLMs). By generating structured scene descriptions and incorporating an action consistency loss, ACTLLM is expected to enhance the fusion of visual and policy elements, facilitating the efficient execution of complex tasks within a multi-turn visual dialogue framework.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>PI Researcher: Chenliang Xu, Ä¢¹½´«Ã½ Company Partner: Corning In robot manipulation, adapting to dynamic environments with flexible task specifications is challenging. Language-based vision manipulation systems offer a solution&hellip;<\/p>\n","protected":false},"author":122,"featured_media":9232,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[222],"tags":[252,242],"class_list":["post-18352","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-research-partnerships","tag-252","tag-past-project"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.5 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Leverage Large Language Models for Complex Robot Manipulation - Center of Excellence in Data Science and Artificial Intelligence<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.rochester.edu\/coe\/leverage-large-language-models-for-complex-robot-manipulation\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Leverage Large Language Models for Complex Robot Manipulation - Center of Excellence in Data Science and Artificial Intelligence\" \/>\n<meta property=\"og:description\" content=\"PI Researcher: Chenliang Xu, Ä¢¹½´«Ã½ Company Partner: Corning In robot manipulation, adapting to dynamic environments with flexible task specifications is challenging. Language-based vision manipulation systems offer a solution&hellip;\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.rochester.edu\/coe\/leverage-large-language-models-for-complex-robot-manipulation\/\" \/>\n<meta property=\"og:site_name\" content=\"Center of Excellence in Data Science and Artificial Intelligence\" \/>\n<meta property=\"article:published_time\" content=\"2026-03-20T17:07:59+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.rochester.edu\/coe\/wp-content\/uploads\/2023\/10\/ChenliangXu.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"300\" \/>\n\t<meta property=\"og:image:height\" content=\"375\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Nick Koziol\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Nick Koziol\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"1 minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/www.rochester.edu\\\/coe\\\/leverage-large-language-models-for-complex-robot-manipulation\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.rochester.edu\\\/coe\\\/leverage-large-language-models-for-complex-robot-manipulation\\\/\"},\"author\":{\"name\":\"Nick Koziol\",\"@id\":\"https:\\\/\\\/www.rochester.edu\\\/coe\\\/#\\\/schema\\\/person\\\/522f11e09ef4e261f96eeeb4ca753816\"},\"headline\":\"Leverage Large Language Models for Complex Robot Manipulation\",\"datePublished\":\"2026-03-20T17:07:59+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/www.rochester.edu\\\/coe\\\/leverage-large-language-models-for-complex-robot-manipulation\\\/\"},\"wordCount\":119,\"image\":{\"@id\":\"https:\\\/\\\/www.rochester.edu\\\/coe\\\/leverage-large-language-models-for-complex-robot-manipulation\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.rochester.edu\\\/coe\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/ChenliangXu.jpg\",\"keywords\":[\"2025\",\"Past Project\"],\"articleSection\":[\"Research Partnerships\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.rochester.edu\\\/coe\\\/leverage-large-language-models-for-complex-robot-manipulation\\\/\",\"url\":\"https:\\\/\\\/www.rochester.edu\\\/coe\\\/leverage-large-language-models-for-complex-robot-manipulation\\\/\",\"name\":\"Leverage Large Language Models for Complex Robot Manipulation - Center of Excellence in Data Science and Artificial Intelligence\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.rochester.edu\\\/coe\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/www.rochester.edu\\\/coe\\\/leverage-large-language-models-for-complex-robot-manipulation\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/www.rochester.edu\\\/coe\\\/leverage-large-language-models-for-complex-robot-manipulation\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.rochester.edu\\\/coe\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/ChenliangXu.jpg\",\"datePublished\":\"2026-03-20T17:07:59+00:00\",\"author\":{\"@id\":\"https:\\\/\\\/www.rochester.edu\\\/coe\\\/#\\\/schema\\\/person\\\/522f11e09ef4e261f96eeeb4ca753816\"},\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.rochester.edu\\\/coe\\\/leverage-large-language-models-for-complex-robot-manipulation\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/www.rochester.edu\\\/coe\\\/leverage-large-language-models-for-complex-robot-manipulation\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.rochester.edu\\\/coe\\\/leverage-large-language-models-for-complex-robot-manipulation\\\/#primaryimage\",\"url\":\"https:\\\/\\\/www.rochester.edu\\\/coe\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/ChenliangXu.jpg\",\"contentUrl\":\"https:\\\/\\\/www.rochester.edu\\\/coe\\\/wp-content\\\/uploads\\\/2023\\\/10\\\/ChenliangXu.jpg\",\"width\":300,\"height\":375},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.rochester.edu\\\/coe\\\/leverage-large-language-models-for-complex-robot-manipulation\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/www.rochester.edu\\\/coe\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Leverage Large Language Models for Complex Robot Manipulation\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.rochester.edu\\\/coe\\\/#website\",\"url\":\"https:\\\/\\\/www.rochester.edu\\\/coe\\\/\",\"name\":\"Center of Excellence in Data Science and Artificial Intelligence\",\"description\":\"Ä¢¹½´«Ã½\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/www.rochester.edu\\\/coe\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/www.rochester.edu\\\/coe\\\/#\\\/schema\\\/person\\\/522f11e09ef4e261f96eeeb4ca753816\",\"name\":\"Nick Koziol\",\"sameAs\":[\"https:\\\/\\\/www.rochester.edu\\\/coe\"]}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Leverage Large Language Models for Complex Robot Manipulation - Center of Excellence in Data Science and Artificial Intelligence","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.rochester.edu\/coe\/leverage-large-language-models-for-complex-robot-manipulation\/","og_locale":"en_US","og_type":"article","og_title":"Leverage Large Language Models for Complex Robot Manipulation - Center of Excellence in Data Science and Artificial Intelligence","og_description":"PI Researcher: Chenliang Xu, Ä¢¹½´«Ã½ Company Partner: Corning In robot manipulation, adapting to dynamic environments with flexible task specifications is challenging. Language-based vision manipulation systems offer a solution&hellip;","og_url":"https:\/\/www.rochester.edu\/coe\/leverage-large-language-models-for-complex-robot-manipulation\/","og_site_name":"Center of Excellence in Data Science and Artificial Intelligence","article_published_time":"2026-03-20T17:07:59+00:00","og_image":[{"url":"https:\/\/www.rochester.edu\/coe\/wp-content\/uploads\/2023\/10\/ChenliangXu.jpg","width":300,"height":375,"type":"image\/jpeg"}],"author":"Nick Koziol","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Nick Koziol","Est. reading time":"1 minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.rochester.edu\/coe\/leverage-large-language-models-for-complex-robot-manipulation\/#article","isPartOf":{"@id":"https:\/\/www.rochester.edu\/coe\/leverage-large-language-models-for-complex-robot-manipulation\/"},"author":{"name":"Nick Koziol","@id":"https:\/\/www.rochester.edu\/coe\/#\/schema\/person\/522f11e09ef4e261f96eeeb4ca753816"},"headline":"Leverage Large Language Models for Complex Robot Manipulation","datePublished":"2026-03-20T17:07:59+00:00","mainEntityOfPage":{"@id":"https:\/\/www.rochester.edu\/coe\/leverage-large-language-models-for-complex-robot-manipulation\/"},"wordCount":119,"image":{"@id":"https:\/\/www.rochester.edu\/coe\/leverage-large-language-models-for-complex-robot-manipulation\/#primaryimage"},"thumbnailUrl":"https:\/\/www.rochester.edu\/coe\/wp-content\/uploads\/2023\/10\/ChenliangXu.jpg","keywords":["2025","Past Project"],"articleSection":["Research Partnerships"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/www.rochester.edu\/coe\/leverage-large-language-models-for-complex-robot-manipulation\/","url":"https:\/\/www.rochester.edu\/coe\/leverage-large-language-models-for-complex-robot-manipulation\/","name":"Leverage Large Language Models for Complex Robot Manipulation - Center of Excellence in Data Science and Artificial Intelligence","isPartOf":{"@id":"https:\/\/www.rochester.edu\/coe\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.rochester.edu\/coe\/leverage-large-language-models-for-complex-robot-manipulation\/#primaryimage"},"image":{"@id":"https:\/\/www.rochester.edu\/coe\/leverage-large-language-models-for-complex-robot-manipulation\/#primaryimage"},"thumbnailUrl":"https:\/\/www.rochester.edu\/coe\/wp-content\/uploads\/2023\/10\/ChenliangXu.jpg","datePublished":"2026-03-20T17:07:59+00:00","author":{"@id":"https:\/\/www.rochester.edu\/coe\/#\/schema\/person\/522f11e09ef4e261f96eeeb4ca753816"},"breadcrumb":{"@id":"https:\/\/www.rochester.edu\/coe\/leverage-large-language-models-for-complex-robot-manipulation\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.rochester.edu\/coe\/leverage-large-language-models-for-complex-robot-manipulation\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.rochester.edu\/coe\/leverage-large-language-models-for-complex-robot-manipulation\/#primaryimage","url":"https:\/\/www.rochester.edu\/coe\/wp-content\/uploads\/2023\/10\/ChenliangXu.jpg","contentUrl":"https:\/\/www.rochester.edu\/coe\/wp-content\/uploads\/2023\/10\/ChenliangXu.jpg","width":300,"height":375},{"@type":"BreadcrumbList","@id":"https:\/\/www.rochester.edu\/coe\/leverage-large-language-models-for-complex-robot-manipulation\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.rochester.edu\/coe\/"},{"@type":"ListItem","position":2,"name":"Leverage Large Language Models for Complex Robot Manipulation"}]},{"@type":"WebSite","@id":"https:\/\/www.rochester.edu\/coe\/#website","url":"https:\/\/www.rochester.edu\/coe\/","name":"Center of Excellence in Data Science and Artificial Intelligence","description":"Ä¢¹½´«Ã½","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.rochester.edu\/coe\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/www.rochester.edu\/coe\/#\/schema\/person\/522f11e09ef4e261f96eeeb4ca753816","name":"Nick Koziol","sameAs":["https:\/\/www.rochester.edu\/coe"]}]}},"_links":{"self":[{"href":"https:\/\/www.rochester.edu\/coe\/wp-json\/wp\/v2\/posts\/18352","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.rochester.edu\/coe\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.rochester.edu\/coe\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.rochester.edu\/coe\/wp-json\/wp\/v2\/users\/122"}],"replies":[{"embeddable":true,"href":"https:\/\/www.rochester.edu\/coe\/wp-json\/wp\/v2\/comments?post=18352"}],"version-history":[{"count":1,"href":"https:\/\/www.rochester.edu\/coe\/wp-json\/wp\/v2\/posts\/18352\/revisions"}],"predecessor-version":[{"id":18362,"href":"https:\/\/www.rochester.edu\/coe\/wp-json\/wp\/v2\/posts\/18352\/revisions\/18362"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.rochester.edu\/coe\/wp-json\/wp\/v2\/media\/9232"}],"wp:attachment":[{"href":"https:\/\/www.rochester.edu\/coe\/wp-json\/wp\/v2\/media?parent=18352"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.rochester.edu\/coe\/wp-json\/wp\/v2\/categories?post=18352"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.rochester.edu\/coe\/wp-json\/wp\/v2\/tags?post=18352"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}