AWS Big Data profile picture

Optimize data layout by bucketing with Amazon Athena and AWS Glue to accelerate downstream queries In this post, we discuss how to implement bucketing on AWS data lakes, including using Athena CTAS statement and AWS Glue for Apache Spark. We also cover bucketing for Apache Iceberg tables.
https://aws.amazon.com/blogs/b....ig-data/optimize-dat

image

Discover the world at Altruu, The Discovery Engine
    AWS Big Data profile picture

Orchestrate an end-to-end ETL pipeline using Amazon S3, AWS Glue, and Amazon Redshift Serverless with Amazon MWAA Amazon Managed Workflows for Apache Airflow (Amazon MWAA) is a managed orchestration service for Apache Airflow that you can use to set up and operate data pipelines in the cloud at scale. Apache Airflow is an open source tool used to programmatically author, schedule, and monitor sequences of processes and tasks, referred to as workflows. […]
https://aws.amazon.com/blogs/b....ig-data/orchestrate-

image

Discover the world at Altruu, The Discovery Engine
    AWS Big Data profile picture
2 Tage - übersetzen

image
Run interactive workloads on Amazon EMR Serverless from Amazon EMR Studio Starting from release 6.14, Amazon EMR Studio supports interactive analytics on Amazon EMR Serverless. You can now use EMR Serverless applications as the compute, in addition to Amazon EMR on EC2 clusters and Amazon EMR on EKS virtual clusters, to run JupyterLab notebooks from EMR Studio Workspaces. EMR Studio is an integrated development environment (IDE) […]
https://aws.amazon.com/blogs/b....ig-data/run-interact


Discover the world at Altruu, The Discovery Engine
    AWS Big Data profile picture
4 Tage - übersetzen

Dynamic DAG generation with YAML and DAG Factory in Amazon MWAA Amazon Managed Workflow for Apache Airflow (Amazon MWAA) is a managed service that allows you to use a familiar Apache Airflow environment with improved scalability, availability, and security to enhance and scale your business workflows without the operational burden of managing the underlying infrastructure. In Airflow, Directed Acyclic Graphs (DAGs) are defined as Python code. […]
https://aws.amazon.com/blogs/b....ig-data/dynamic-dag-

image

Discover the world at Altruu, The Discovery Engine
    AWS Big Data profile picture
8 Tage - übersetzen

How Salesforce optimized their detection and response platform using AWS managed services Headquartered in San Francisco, Salesforce, Inc. is a cloud-based customer relationship management (CRM) software company building artificial intelligence (AI)-powered business applications that allow businesses to connect with their customers in new and personalized ways. In this post, we discuss how the Salesforce TIP team optimized their architecture using Amazon Web Services (AWS) managed services to achieve better scalability, cost, and operational efficiency.
https://aws.amazon.com/blogs/b....ig-data/how-salesfor

image

Discover the world at Altruu, The Discovery Engine