Tag: LLM

  • Two LLMs Team Up to Help Robots Interpret Vague Instructions and Prioritize What Matters

    Two LLMs Team Up to Help Robots Interpret Vague Instructions and Prioritize What Matters

    Researchers at MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL) have developed a new method that uses two large language models (LLMs) to help robots understand ambiguous instructions and focus on key details. The approach, called Masked Inverse Reinforcement Learning (Masked IRL), reduces the amount of demonstration data needed to teach a robot by nearly five times, while improving the robot’s ability to infer unspoken user preferences.

    Traditional robot training often requires either extensive physical demonstrations or detailed written instructions. Masked IRL automates the process: first, one LLM clarifies ambiguous prompts (e.g., turning “stay close” into “stay close to the surface of the table”) by comparing a user’s demonstration trajectory to the shortest possible path. Then a second LLM evaluates the environment and “masks” irrelevant details – such as a person leaning on a table – while highlighting critical ones like obstacles to avoid. The robot then uses these prioritized details to generate a safe motion plan.

    In experiments, the system correctly identified unstated user preferences up to 15 percent more often than comparable baselines. Real-world tests showed a robotic arm successfully moving a coffee mug around a laptop, wiping a table while “staying close” to it, and handing a user a bag of chips while “staying away” from both the person and the table – all after fewer than 50 kinesthetic demonstrations.

    The team plans to enhance Masked IRL with camera input, allowing robots to visually focus on relevant objects in dynamic environments. The work was supported by the Tata Group via the MIT Generative AI Impact Consortium Award and the Department of Defense, and will be presented at the 2026 IEEE International Conference on Robotics and Automation.

  • JPMorgan Equips 250,000 Employees with AI Assistants from OpenAI and Anthropic

    JPMorgan Equips 250,000 Employees with AI Assistants from OpenAI and Anthropic

    JPMorgan Chase is taking a major step in integrating artificial intelligence across its operations by providing 250,000 employees with access to LLM Suite, a platform that connects staff to large language models from OpenAI and Anthropic. The initiative aims to move beyond simple chatbots toward autonomous AI agents that can handle complex tasks across multiple business functions.

    Derek Waldron, Chief Analytics Officer at JPMorgan, described the vision as one where the bank becomes a fully AI-connected enterprise. In a demonstration, Waldron showed how the platform can create an investment banking presentation in 30 seconds—work that previously required hours from junior bankers.

    Launched in 2023, LLM Suite initially offered OpenAI’s models for drafting emails and summarizing documents. It now incorporates Anthropic’s Claude model as well. About half of the 250,000 employees with access use it daily, and the platform is updated every eight weeks with new data from the bank’s business units.

    Key capabilities include drafting confidential merger and acquisition documents, providing personalized AI assistants for every employee, automating routine back-office processes, and using AI agents to handle complex multi-step tasks autonomously.

    Waldron acknowledged that while AI will empower some workers, others face displacement as processes no longer require human involvement. In May, the head of JPMorgan’s consumer banking division told investors that operations staff would fall by at least 10% over five years due to AI deployment. Senior Wall Street executives have discussed changing the ratio of junior bankers to senior managers from 6-1 to 4-1 as AI handles more work.

    Despite the rapid deployment, Waldron noted it will take years to fully connect AI models with the bank’s data and software, which has an annual technology budget of $18 billion. An MIT report from July found that most corporations had not generated returns on AI projects despite over $30 billion in investments.

  • Huawei Deploys Specialized AI Model to Revolutionize Steel Manufacturing in China

    Huawei Deploys Specialized AI Model to Revolutionize Steel Manufacturing in China

    Huawei has taken a leading role in developing Guangxi’s first AI model tailored for the steel industry, introducing the Xuantie Steel Model to optimize production costs and efficiency through advanced neural networks.

    The Xuantie Model represents a major step forward in domain-specific artificial intelligence, being the first large language model designed specifically for the steel manufacturing sector in Guangxi, China. It was created through a collaboration between Guangxi Liuzhou Iron and Steel Group (Liuzhou Steel Group), Huawei, and China Mobile Guangxi Branch. This initiative demonstrates how general-purpose AI architectures can be fine-tuned for specialized industrial applications.

    The technical infrastructure supporting this deployment includes novel AI applications and a dedicated research facility focused on advancing machine learning capabilities within heavy manufacturing environments. Domain-specific large models like Xuantie address a critical challenge in industrial AI: adapting general-purpose architectures to understand the unique constraints, processes, and terminology of specific sectors. The steel industry presents particular technical challenges due to its complex, multi-stage production processes and the need for real-time decision-making under high-temperature, high-pressure conditions.

    Pre-training Architecture and Model Foundations

    The Xuantie Steel Model’s architecture builds upon Huawei’s Pangu foundation models through transfer learning and domain-specific pre-training. According to Li Bin, Chairman of Liuzhou Steel Group, the model implements a 20+N scenario-based model system that encompasses six critical production domains: pre-smelting, steelmaking, steel rolling, logistics, environmental protection, and safety. This modular architecture allows individual sub-models to be optimized for specific tasks while maintaining coherent integration across the production pipeline.

    The technical framework centers on three core computational paradigms: human-AI interaction systems, data processing and analysis capabilities, and manufacturing process optimization. This tripartite structure reflects current best practices in industrial AI deployment, where models must simultaneously interface with human operators, process vast quantities of sensor data, and make autonomous decisions within tightly constrained operational parameters.

    Jiang Wangcheng, Huawei’s Corporate Vice President and CEO of the Oil, Gas & Mining BU, explained during the recent launch that the development leveraged Huawei’s capabilities in AI, high-performance computing, and network connectivity to create a proprietary technological foundation. The integration of these components could enable end-to-end intelligent manufacturing, with large models deployed throughout the production chain rather than isolated to specific processes.

    Neural Networks in Production Environments

    The practical implementation of AI models within Liuzhou Steel Group’s operations reveals sophisticated applications of machine learning across multiple production stages. According to Shen Min, Vice President of Liuzhou Steel Group, 33 distinct AI models have been deployed in steelmaking processes alone. A 5G-enabled intelligent molten iron transportation system demonstrates end-to-end autonomous operation, while an intelligent scheduling model employs reinforcement learning to optimize inter-process coordination, reportedly improving productivity by 8.5%.

    Neural networks applied to basic oxygen furnaces and ladle argon blowing processes have achieved measurable improvements in product quality while reducing raw material consumption. These models likely employ predictive algorithms that analyze real-time sensor data to optimize chemical compositions and thermal profiles, reducing crude steel production costs by approximately CNY 5 (US$0.73) per metric ton.

    The intelligent refining solution represents a hybrid architecture combining mechanistic models—physics-based simulations of metallurgical processes—with AI prediction algorithms and parameter optimization. This approach could address a common limitation of pure machine learning systems: the need for explainability and physical consistency in safety-critical industrial environments. The system has reportedly reduced comprehensive steel costs by CNY 2 (US$0.29) per metric ton.

    Algorithmic Optimization and Future Developments

    Machine learning algorithms have been applied to plate and plywood assembly optimization, where production planning models have increased yield from 1% to 2%. Contract matching automation, achieving more than 90% accuracy, likely employs natural language processing and constraint satisfaction algorithms to align production capabilities with order specifications.

    Shi Mao, CEO of Huawei’s Steel & Non-ferrous BU, outlines a vision for intelligent manufacturing centered on three technical pillars: human-machine trust, multi-machine collaboration, and autonomous synergy. These concepts suggest advanced architectures where multiple AI agents coordinate across production systems with minimal human intervention while maintaining transparency and reliability.

    The Liuzhou Steel AI Research and Innovation Center could serve as a platform for continued model development and ecosystem collaboration. Plans include developing more than 10 high-level industrial agents for production lines and business domains, alongside more than 30 curated industrial datasets. These datasets could prove particularly valuable, as high-quality, domain-specific training data remains a critical bottleneck in industrial AI development.

    The Xuantie Model demonstrates how foundation models can be adapted to highly-specialized domains through careful pre-training, modular architecture design, and integration with existing industrial control systems. As the technology matures, such domain-specific large models could become increasingly prevalent across manufacturing sectors.