Tuesday, September 26, 2023

Spring Boot service that reads from the AWS Glue Data Catalog

 To create a Spring Boot service that reads from the AWS Glue Data Catalog, you need to set up a few components:

  1. Spring Boot Application: Set up a Spring Boot application.
  2. AWS SDK for Glue: Add the necessary dependencies for AWS Glue.
  3. AWS Configuration: Configure the AWS credentials and region.
  4. Service Class: Create a service class to interact with the Glue Data Catalog.

Here’s a step-by-step guide:

Step 1: Set Up Your Spring Boot Application

Start by creating a new Spring Boot project. You can use Spring Initializr (https://start.spring.io/) to generate a basic Spring Boot project with the necessary dependencies.

Step 2: Add Dependencies

Add the necessary dependencies to your pom.xml file:


<dependencies> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter</artifactId> </dependency> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-web</artifactId> </dependency> <dependency> <groupId>software.amazon.awssdk</groupId> <artifactId>glue</artifactId> </dependency> <dependency> <groupId>software.amazon.awssdk</groupId> <artifactId>auth</artifactId> </dependency> </dependencies>

Step 3: Configure AWS Credentials and Region

Create an application.yml or application.properties file to configure your AWS credentials and region.

yaml
aws: region: us-west-2 accessKeyId: YOUR_ACCESS_KEY_ID secretAccessKey: YOUR_SECRET_ACCESS_KEY

Step 4: Create AWS Configuration Class

Create a configuration class to set up the AWS Glue client.

java

import org.springframework.beans.factory.annotation.Value; import org.springframework.context.annotation.Bean; import org.springframework.context.annotation.Configuration; import software.amazon.awssdk.auth.credentials.AwsBasicCredentials; import software.amazon.awssdk.auth.credentials.StaticCredentialsProvider; import software.amazon.awssdk.regions.Region; import software.amazon.awssdk.services.glue.GlueClient; @Configuration public class AwsConfig { @Value("${aws.accessKeyId}") private String accessKeyId; @Value("${aws.secretAccessKey}") private String secretAccessKey; @Value("${aws.region}") private String region; @Bean public GlueClient glueClient() { return GlueClient.builder() .region(Region.of(region)) .credentialsProvider(StaticCredentialsProvider.create(AwsBasicCredentials.create(accessKeyId, secretAccessKey))) .build(); } }

Step 5: Create a Service Class

Create a service class to interact with the Glue Data Catalog.

java
import org.springframework.beans.factory.annotation.Autowired; import org.springframework.stereotype.Service; import software.amazon.awssdk.services.glue.GlueClient; import software.amazon.awssdk.services.glue.model.GetDatabasesRequest; import software.amazon.awssdk.services.glue.model.GetDatabasesResponse; @Service public class GlueService { private final GlueClient glueClient; @Autowired public GlueService(GlueClient glueClient) { this.glueClient = glueClient; } public GetDatabasesResponse getDatabases() { GetDatabasesRequest request = GetDatabasesRequest.builder().build(); return glueClient.getDatabases(request); } }

Step 6: Create a Controller Class

Create a controller class to expose an endpoint for the service.


import org.springframework.beans.factory.annotation.Autowired; import org.springframework.web.bind.annotation.GetMapping; import org.springframework.web.bind.annotation.RequestMapping; import org.springframework.web.bind.annotation.RestController; import software.amazon.awssdk.services.glue.model.GetDatabasesResponse; @RestController @RequestMapping("/glue") public class GlueController { private final GlueService glueService; @Autowired public GlueController(GlueService glueService) { this.glueService = glueService; } @GetMapping("/databases") public GetDatabasesResponse getDatabases() { return glueService.getDatabases(); } }


PLEASE NOTE: if you do not have access to AWS accessKeyId and secretAccessKey,. you can get an
instance of glueClient using following code snippet

GlueClient glueClient = GlueClient.builder() .region(region) .build();


To access a cross account DB, please use the folloiwng Example
SELECT statement:

SELECT * FROM "glue:arn:aws:glue:us-east-1:999999999999:catalog".tpch1000.customer

Step 7: Run the Application

Run your Spring Boot application. You can access the Glue Data Catalog databases by navigating to http://localhost:8080/glue/databases.

This setup provides a basic Spring Boot service that reads from the AWS Glue Data Catalog. You can extend this to handle more Glue operations as needed

Use SSH Keys to clone GIT Repository using SSH

  1. Generate a New SSH Key Pair bash ssh-keygen -t rsa -b 4096 -C "HSingh@MindTelligent.com" -t rsa specifies the type of key (...