aws-samples

Github Data

Followers 15259

Following 0

Links

https://amazon.com/aws

AI Project

Public repos: 7217Public gists: 0

sample-serverless-llama-server

Serverless LLM Inference: Deploy DeepSeek R1 & LLaMA Models on AWS Lambda with Ultra-Fast Cold Starts

language: Rust

created at: 2025-03-15

updated at: 2025-03-17

sample-chatbot-lambda-snapstart

Serverless DeepSeek R1 Inference with FastAPI and Lambda SnapStart

language: Python

created at: 2025-03-07

updated at: 2025-06-10

bedrock-engineer

Universal AI agent building apps using Amazon Bedrock, capable of customize to create/edit files, execute commands, search the web, use knowledge base, use multi-agents, generative images and more.

star: 347fork: 44

language: TypeScript

created at: 2025-02-07

updated at: 2025-07-03

easy-model-deployer

A user-friendly Command-line/SDK tool that makes it quickly and easier to deploy open-source LLMs on AWS

star: 53fork: 9

language: Python

created at: 2025-01-25

updated at: 2025-06-26

nki-llama

star: 12fork: 12

language: Python

created at: 2025-01-21

updated at: 2025-03-26

swift-chat

A lightning-fast, cross-platform AI chat application built with React Native.

star: 445fork: 46

language: TypeScript

created at: 2024-11-06

updated at: 2025-05-26

fine-tune-qwen2-vl-with-llama-factory

star: 12fork: 2

language: Jupyter Notebook

created at: 2024-10-17

updated at: 2025-03-06

jira-ticket-classification

A Python-based AWS solution for automated Jira ticket classification using Amazon Bedrock. This project helps Jira users automate ticket categorization featuring S3 integration, AWS Glue deduplication, LLMs, and Terraform deployment.

language: Python

created at: 2024-10-04

updated at: 2024-11-28

sagemaker-hosted-stable-video-diffusion-img2vid-xt

language: Jupyter Notebook

created at: 2024-08-09

updated at: 2025-06-18

deploy-langfuse-on-ecs-with-fargate

Self-hosting Langfuse on Amazon ECS with Fargate using CDK Python

star: 34fork: 6

language: Python

created at: 2024-08-01

updated at: 2025-02-09

failure-analysis-assistant

Failure Analysis Assistant (FA2) is a sample that supports failure analysis when a failure occurs using LLM in Amazon Bedrock. It was shown as an exhibition demo for AWS Summit Japan 2024.

star: 18fork: 4

language: TypeScript

created at: 2024-07-17

updated at: 2025-01-28

function-calling-using-amazon-bedrock-anthropic-claude-3

Function calling using Amazon Bedrock with Anthropic Claude 3 foundation model

star: 26fork: 7

language: Python

created at: 2024-07-07

updated at: 2025-01-20

finetune-sdxl-on-sagemaker-and-host-on-infr2

language: Jupyter Notebook

created at: 2024-05-28

updated at: 2024-08-21

Meta-Llama-on-AWS

star: 93fork: 23

language: Jupyter Notebook

created at: 2024-05-20

updated at: 2025-06-19

comfyui-on-amazon-sagemaker

This project demonstrates how to generate images using Stable Diffusion or FLUX.1 models by hosting ComfyUI on Amazon SageMaker inference endpoint.

star: 44fork: 11

language: Python

created at: 2024-05-16

updated at: 2025-02-07

whats-new-summary-notifier

A generative AI application that summarizes the content of AWS What's New and other web articles in multiple languages, and delivers the summary to Slack or Microsoft Teams.

star: 41fork: 5

language: Python

created at: 2024-05-07

updated at: 2025-01-30

qa-app-with-rag-using-amazon-bedrock-and-kendra

Question Answering Generative AI application with Large Language Models and RAG powered by Amazon Bedrock and Amazon Kendra

language: Python

created at: 2024-05-07

updated at: 2024-12-03

sample-chatbot-for-bedrock-knowledge-base-and-multimodal-llms

Multimodal Chatbot with Amazon Bedrock Knowledge Bases Integration

star: 14fork: 2

language: Python

created at: 2024-05-03

updated at: 2025-01-21

foundational-llm-chat

Chainlit application built using AWS CDK, secured with Amazon Cognito, that allows you to interact with Anthropic's Claude language models from Amazon Bedrock.

star: 13fork: 4

language: TypeScript

created at: 2024-05-02

updated at: 2024-12-31

async-stable-diffusion-image-api

language: Python

created at: 2024-04-17

updated at: 2024-07-16

build-an-agentic-llm-assistant

Labs for the "Build an agentic LLM assistant on AWS" workshop. A step by step agentic llm assistant development workshop using serverless three-tier architecture.

star: 52fork: 18

language: Jupyter Notebook

created at: 2024-04-11

updated at: 2025-02-03

rag-with-amazon-bedrock-and-opensearch

Opinionated sample on how to build and deploy a RAG application with Amazon Bedrock and OpenSearch

star: 28fork: 1

language: Python

created at: 2024-03-28

updated at: 2025-02-09

generative-bi-using-rag

A solution guidance for Generative BI using Amazon Bedrock, Amazon OpenSearch with RAG

star: 134fork: 42

language: Python

created at: 2024-03-12

updated at: 2025-02-06

rag-with-amazon-opensearch-and-sagemaker

Question Answering Generative AI application with Large Language Models (LLMs) and Amazon OpenSearch Service

star: 22fork: 3

language: Python

created at: 2024-02-21

updated at: 2025-01-28

amazon-bedrock-rag

Fully managed RAG solution implemented using Knowledge Bases for Amazon Bedrock

star: 87fork: 24

language: JavaScript

created at: 2024-02-19

updated at: 2025-02-08

rag-with-amazon-bedrock-and-documentdb

Question Answering Generative AI application with Large Language Models (LLMs), Amazon Bedrock, and Amazon DocumentDB (with MongoDB Compatibility)

language: Jupyter Notebook

created at: 2024-02-16

updated at: 2024-12-03

conversational-ai-assistant-multi-route-chain

This GitHub repository guides you through building an advanced Conversational AI assistant using AWS services and Anthropic's Claude V2 model. It features intelligent routing to relevant functions, database querying, semantic searches, Lambda function executions, and specialized interactions.

star: 28fork: 8

language: Python

created at: 2024-02-09

updated at: 2025-02-01

rag-with-amazon-bedrock-and-memorydb

Question Answering Generative AI application with Large Language Models (LLMs), Amazon Bedrock, and Amazon MemoryDB for Redis

star: 10fork: 1

language: Jupyter Notebook

created at: 2024-02-03

updated at: 2025-02-06

text-to-sql-bedrock-workshop

This repository is intended for those looking to dive deep on advanced Text-to-SQL concepts.

star: 100fork: 21

language: Jupyter Notebook

created at: 2024-01-31

updated at: 2025-02-07

rag-with-amazon-bedrock-and-pgvector

Opinionated sample on how to build/deploy a RAG web app on AWS powered by Amazon Bedrock and PGVector (on Amazon RDS)

star: 71fork: 16

language: Python

created at: 2024-01-22

updated at: 2025-01-31

foundation-model-benchmarking-tool

Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stack options.

star: 237fork: 41

language: Jupyter Notebook

created at: 2024-01-09

updated at: 2025-04-11

rag-with-amazon-postgresql-using-pgvector-and-sagemaker

Question Answering application with Large Language Models (LLMs) and Amazon Postgresql using pgvector

star: 14fork: 2

language: Python

created at: 2024-01-06

updated at: 2025-01-05

rag-with-amazon-opensearch-serverless-and-sagemaker

Question Answering Generative AI application with Large Language Models (LLMs) and Amazon OpenSearch Serverless Service

language: Python

created at: 2024-01-06

updated at: 2024-12-03

multimodal-rag-on-slide-decks

'Talk to your slide deck' (Multimodal RAG) using foundation models (FMs) hosted on Amazon Bedrock and Amazon SageMaker

star: 36fork: 6

language: HTML

created at: 2023-12-19

updated at: 2025-01-30

text-embeddings-pipeline-for-rag

A pipeline to convert contextual knowledge stored in documents and databases into text embeddings, and store them in a vector store

star: 15fork: 1

language: TypeScript

created at: 2023-11-28

updated at: 2025-01-07

aws-agentic-document-assistant

An agent based LLM assistant that extends RAG with batch entity extraction and SQL querying to improve performance on multi-step and analytical questions.

star: 66fork: 24

language: Jupyter Notebook

created at: 2023-11-13

updated at: 2025-02-09

amazon-sagemaker-llama2-prompting-best-practices

Best practices for prompting for Meta's Llama2 Large Language Model using Amazon Sagemaker

language: Jupyter Notebook

created at: 2023-11-02

updated at: 2025-01-09

amazon-sagemaker-llama2-response-streaming-recipes

Amazon SageMaker Llama 2 Inference via Response Streaming

star: 13fork: 4

language: Jupyter Notebook

created at: 2023-10-11

updated at: 2024-09-08

bedrock-kb-rag-workshop

Bedrock Knowledge Base and Agents for Retrieval Augmented Generation (RAG)

star: 50fork: 11

language: HTML

created at: 2023-10-02

updated at: 2024-12-10

awsome-distributed-training

Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.

star: 239fork: 95

language: Python

created at: 2023-09-30

updated at: 2025-02-06

serverless-pdf-chat

LLM-powered document chat using Amazon Bedrock and AWS Serverless

star: 255fork: 242

language: TypeScript

created at: 2023-09-30

updated at: 2025-02-01

serverless-rag-demo

Amazon Bedrock Foundation models with Amazon Opensearch Serverless as a Vector DB

star: 167fork: 49

language: Python

created at: 2023-08-23

updated at: 2025-02-05

bedrock-claude-chat

AWS-native chatbot using Bedrock + Claude (+Nova and Mistral)

star: 1.0Kfork: 371

language: TypeScript

created at: 2023-08-22

updated at: 2025-02-10

rag-using-langchain-amazon-bedrock-and-opensearch

RAG with langchain using Amazon Bedrock and Amazon OpenSearch

star: 205fork: 39

language: Python

created at: 2023-08-17

updated at: 2025-02-04

generative-ai-use-cases

Application implementation with business use cases for safely utilizing generative AI in business operations

star: 1.1Kfork: 283

language: TypeScript

created at: 2023-08-17

updated at: 2025-07-03

amazon-bedrock-samples

This repository contains examples for customers to get started using the Amazon Bedrock Service. This contains examples for all available foundational models

star: 821fork: 365

language: Jupyter Notebook

created at: 2023-07-05

updated at: 2025-02-10

aws-genai-llm-chatbot

A modular and comprehensive solution to deploy a Multi-LLM and Multi-RAG powered chatbot (Amazon Bedrock, Anthropic, HuggingFace, OpenAI, Meta, AI21, Cohere, Mistral) using AWS CDK on AWS

star: 1.2Kfork: 347

language: TypeScript

created at: 2023-06-16

updated at: 2025-02-09

private-llm-qa-bot

star: 263fork: 64

language: Jupyter Notebook

created at: 2023-05-18

updated at: 2025-01-16

llm-apps-workshop

Use LLMs for building real-world apps

star: 111fork: 38

language: HTML

created at: 2023-04-19

updated at: 2025-01-28

lm-gvp

LM-GVP: A Generalizable Deep Learning Framework for Protein Property Prediction from Sequence and Structure

star: 52fork: 7

language: Jupyter Notebook

created at: 2021-08-18

updated at: 2025-02-06

coldstart-recs-on-aws-trainium

End-to-end solution for cold-start recommendations using vLLM, DeepSeek Llama (8B & 70B), and FAISS on AWS Trainium (Trn1) with the Neuron SDK and NeuronX Distributed. Includes LLM-based interest expansion, embedding comparisons (T5 & SentenceTransformers), and scalable retrieval workflows.

language: Jupyter Notebook

created at: 2020-06-23

updated at: 2025-04-22

scalable-hw-agnostic-inference

A hardware-agnostic (NVIDIA's GPUs and AWS Inferentia accelerators) deployment of computer-vision models (e.g., YOLO, ViT), generate text and text-to-image (e.g., Llama3 and Stable Diffusion ) on EKS controlled by K8s ingress in routing-time and Karpenter in scheduling-time that is scaled by KEDA.

star: 23fork: 11

language: Python

created at: 2019-11-05

updated at: 2025-07-01

aws-kr-startup-samples

A collection of AWS Korea Startup SA team materials for hands-on labs.

star: 19fork: 8

language: Jupyter Notebook

created at: 2019-02-19

updated at: 2025-02-09