Machine Learning profile picture

Customize DeepSeek-R1 671b model using Amazon SageMaker HyperPod recipes – Part 2 In this post, we use the recipes to fine-tune the original DeepSeek-R1 671b parameter model. We demonstrate this through the step-by-step implementation of these recipes using both SageMaker training jobs and SageMaker HyperPod.
https://aws.amazon.com/blogs/m....achine-learning/cust

image

Discover the world at Altruu, The Discovery Engine
    Machine Learning profile picture

Cost-effective AI image generation with PixArt-Σ inference on AWS Trainium and AWS Inferentia This post is the first in a series where we will run multiple diffusion transformers on Trainium and Inferentia-powered instances. In this post, we show how you can deploy PixArt-Sigma to Trainium and Inferentia-powered instances.
https://aws.amazon.com/blogs/m....achine-learning/cost

image

Discover the world at Altruu, The Discovery Engine
    Machine Learning profile picture

Build a financial research assistant using Amazon Q Business and Amazon QuickSight for generative AI–powered insights In this post, we show you how Amazon Q Business can help augment your generative AI needs in all the abovementioned use cases and more by answering questions, providing summaries, generating content, and securely completing tasks based on data and information in your enterprise systems.
https://aws.amazon.com/blogs/m....achine-learning/buil

image

Discover the world at Altruu, The Discovery Engine
    Machine Learning profile picture

Study shows vision-language models can’t handle queries with negation words Words like “no” and “not” can cause this popular class of AI models to fail unexpectedly in high-stakes settings, such as medical diagnosis.
https://news.mit.edu/2025/stud....y-shows-vision-langu

image

Discover the world at Altruu, The Discovery Engine
    Machine Learning profile picture

How Hexagon built an AI assistant using AWS generative AI services Recognizing the transformative benefits of generative AI for enterprises, we at Hexagon’s Asset Lifecycle Intelligence division sought to enhance how users interact with our Enterprise Asset Management (EAM) products. Understanding these advantages, we partnered with AWS to embark on a journey to develop HxGN Alix, an AI-powered digital worker using AWS generative AI services. This blog post explores the strategy, development, and implementation of HxGN Alix, demonstrating how a tailored AI solution can drive efficiency and enhance user satisfaction.
https://aws.amazon.com/blogs/m....achine-learning/how-

image

Discover the world at Altruu, The Discovery Engine