NVIDIA Demos Personalized Chatbot with Local AI Acceleration
-
Chat with RTX is a free tech demo that lets users personalize a chatbot with their own content, accelerated by an NVIDIA RTX GPU.
-
It uses retrieval-augmented generation and TensorRT-LLM software to bring generative AI capabilities to local PCs.
-
Users can connect files or YouTube videos to the chatbot as a dataset for quick, contextual answers to queries.
-
It runs locally on the user's device so responses are fast and data stays private.
-
Developers can use the TensorRT-LLM RAG reference project on GitHub to build their own RTX-accelerated apps.