How Prediction Guard Delivers Trustworthy AI

How Prediction Guard Delivers Trustworthy AI on Intel® Gaudi® 2 AI Accelerators

Subscribe Now

Stay in the know on all things CODE. Updates are delivered to your inbox.

Overview

Large language models (LLM) promise to revolutionize how enterprises operate, but making them production-ready means solving privacy risks, security vulnerabilities, and performance bottlenecks.

Not so easy.

This session focuses on how AI startup Prediction Guard found a solution to these challenges by using the processing power of Intel® Gaudi® 2 AI accelerators in the Intel® Tiber™ AI Cloud.¹The topics include:

Prediction Guard’s pioneering work with hosting open source LLMs like Llama 2 and neural-chat-7B in a secure, privacy-preserving environment with filters for PII, prompt-injection attacks, toxic outputs, and factual inconsistencies.
How Prediction Guard optimized batching, model replication, tensor shaping, and hyperparameters for 2x throughput gains and industry-leading time to first token for streaming.
Architectural insights and best practices for capitalizing on LLMs.

Skill level: Expert

Featured Software

This session showcases the Intel Tiber AI Cloud: Learn More | Sign Up

Download Code Samples

Intel and Hugging Face* Neural-Chat-7B

See All Code Samples

Other Resources

Ecosystem Developer Hub

Intel® Liftoff for Startups

Jump to:

You May Also Like

You May Also Like

Related Articles

Prediction Guard Reduces Risks in LLM Applications

Trusted AI in the Intel Tiber AI Cloud

Seekr*: Build Trustworthy LLMs for Evaluating and Generating Content at Scale

Accelerate Meta* Llama 3 with Intel AI Solutions

Related Webinar

How to Use Intel-Optimized AI Software in the Cloud

<link rel="stylesheet" href="/etc.clientlibs/settings/wcm/designs/ver/250512/intel/clientlibs/pages/commons-page.min.css" type="text/css"><script src="/etc.clientlibs/settings/wcm/designs/ver/250512/intel/clientlibs/pages/commons-page.min.js" defer></script>

<link rel="preload" href="/etc.clientlibs/settings/wcm/designs/ver/250512/intel/clientlibs/pages/atomVideo.min.css" as="style"><link rel="stylesheet" href="/etc.clientlibs/settings/wcm/designs/ver/250512/intel/clientlibs/pages/atomVideo.min.css" type="text/css"><script src="/etc.clientlibs/settings/wcm/designs/ver/250512/intel/clientlibs/pages/atomVideo.min.js"></script>

<link rel="preload" href="/etc.clientlibs/settings/wcm/designs/ver/250512/intel/clientlibs/pages/colorBlock.min.css" as="style"><link rel="stylesheet" href="/etc.clientlibs/settings/wcm/designs/ver/250512/intel/clientlibs/pages/colorBlock.min.css" type="text/css">

<link rel="preload" href="/etc.clientlibs/settings/wcm/designs/ver/250512/intel/clientlibs/pages/contact-us.min.css" as="style"><link rel="stylesheet" href="/etc.clientlibs/settings/wcm/designs/ver/250512/intel/clientlibs/pages/contact-us.min.css" type="text/css">

<script>!function(){var e=setInterval(function(){"undefined"!=typeof $CQ&&($CQ(function(){CQ_Analytics.SegmentMgr.loadSegments("/etc/segmentation"),CQ_Analytics.ClientContextUtils.init("/etc/clientcontext/intel",window.location.pathname.substr(0,window.location.pathname.indexOf(".")))}),clearInterval(e))},100)}();</script>