Amazon has named its newest AI assistant Rufus. But what’s the story behind this distinctive name, and how does it stack up against the titles chosen for other AIs?
In Ancient Rome, it was a nickname given to those with red hair. The ruddy complexion of William II of England, who was the third son of William the Conqueror, also earned him this moniker. Nowadays in the UK, the name “Rufus” is often associated with nobility. However, it is perhaps more frequently heard being called out by dog owners while walking their pets due to its charming similarity to barking.
Consequently, it might come as a surprise that the large online retailer Amazon has chosen the same name for its latest AI assistant. In November 2024, it will mark a decade since Amazon introduced its first voice-activated smart assistant, Alexa. The name was reportedly chosen as a tribute to the ancient Library of Alexandria in Egypt.
However, the origins of the new assistant’s name are arguably more endearing—Rufus is named after Amazon’s first “office dog.”
Rufus joins an expanding roster of AI assistants created by major tech companies that have rather obscure names. Initially, Google’s Gemini was called “Titan,” but this was changed by the DeepMind team working on it. Gemini translates from Latin as twins, highlighting the collaboration between the DeepMind and Google Research teams behind the initiative, as well as the dual nature commonly associated with the Gemini zodiac sign.
The recently introduced Apple Intelligence—which incorporates AI features into Apple’s devices—has a name that is quite straightforward. However, the name of Apple’s Siri digital assistant was chosen by Dag Kittlaus, the co-founder of the company that originally developed the voice-activated software. He named it after a Norwegian colleague, with its meaning in Norwegian being “beautiful woman who leads you to victory,” but it was also selected because it was easy to pronounce, according to Kittlaus.
Stories about Rufus, a delightful Welsh corgi, trace back to the earliest days of Amazon.
In 1996, amid the dot-com boom and shortly after Susan and Eric Benson started working at the then-two-year-old startup, there were fewer than 20 employees at Amazon, and Eric was the fifth engineer hired. Susan, a journalist, would become the editorial voice of the pioneering online retailer. As they worked long hours in preparation for the company’s public offering, they began bringing their two-year-old dog to the office.
It didn’t take long for the dog to become part of the workday: chasing tennis balls in hallways, begging for treats from colleagues, snoozing through meetings under tables, and using his paw (with a little help from humans) to activate features on the website, like the algorithm recommending books based on others’ preferences.
Rufus eventually became the face of Amazon’s error page to indicate when something went wrong.
Even after the Bensons retired in 2001, Rufus continued to be seen on campus with his dog sitter, who was still employed there. The dog lived to the age of 15 and helped establish the pet-friendly culture at Amazon, allowing employees to bring their animals to work.
Today, Amazon boasts over 10,000 dogs registered as “working” within the company. The various campuses around the world feature enticing amenities, including well-equipped dog parks, a bounty of treats, and gatherings for furry friends.
Amazon is not the only tech firm to adopt a dog-friendly environment. Google permits dogs in its offices, and their employees who own dogs are affectionately known as Dooglers. The Mountain View campus even houses a dog park called the Doogleplex.
Additionally, there is an office building in downtown Seattle named after Rufus, the first dog of Amazon, and it is where the AI-powered shopping assistant was developed, according to Rajiv Mehta, Amazon’s vice president of search and conversational shopping.
“We conducted extensive research on the name Rufus across many of Amazon’s markets and found that it was a brand and name that resonated well with customers globally,” Mehta states.
Though Rufus passed away in 2009, his memory endures. Amazon has recently launched its AI assistant in the U.S., India, and the U.K. to assist customers with their purchasing choices. It aims to respond to queries about products.
While customers can interact with Rufus, the AI assistant currently lacks a voice and can only reply with text and images. Mehta refrained from speculating on whether it would be integrated with Alexa. “I can’t comment on plans regarding Alexa,” he notes. “The teams are completely separate at this stage.”
So far, Rufus has successfully answered over 100 million queries, according to Mehta
When we asked Rufus for more information about dogs, its response was as heartwarming as the original Rufus:
Canines are among the most cherished pets globally. They are faithful, amiable, and make excellent companions. Regardless of their variations, all dogs possess a few fundamental characteristics—they are loving, protective of their owners, and keen to satisfy. Dogs need appropriate care, which includes consistent exercise, grooming, and veterinary visits. In exchange, they offer unconditional affection and can enhance their owner’s physical and mental well-being. Whether as a household pet or a working creature, dogs have played a crucial role in human society for thousands of years.
Although we can’t determine how satisfied the original Rufus would have felt with this summary, he likely would have lamented the absence of treats.
Amazon Prime Day has become a hotspot for shoppers eager to snag fantastic deals. The discounts can be significant—sometimes exceeding 40 percent off a product—but without knowledge of a product’s pricing history, it’s difficult to ascertain if a deal is truly worth it. Shoppers have traditionally relied on third-party price tracking services to bridge this gap, but now Amazon claims it will allow select users to access this data through a quick conversation with its AI shopping assistant, Rufus.
For instance, consider this week’s promotion on LifeStraw’s personal water filter. As part of Prime Big Deal Days, the filter was priced at $10, which was half of its original cost of $20, making it seem like quite the steal. However, was it genuinely a good deal, or had the price been even cheaper? Until recently, the company wouldn’t disclose this information. But in recent weeks, it has subtly commenced testing increased transparency. Accessing this information involves simply conversing with Rufus.
Introduced to some users in February and made available to everyone in the U.S. in July, Rufus is Amazon’s shopping-oriented response to ChatGPT. It is accessible by clicking the speech bubble icon located at the bottom right of Amazon’s app or the top left of its website’s navigation.
Some initial feedback criticized it as unreliable and only somewhat helpful. Rajiv Mehta, Amazon’s vice president for search and conversational shopping, shared in a blog post last month that users have been inundating Rufus with inquiries regarding product specifications, buying suggestions, and comparisons between items. Rufus can also respond to questions about orders or even the meaning of life.
What Mehta neglected to mention was Rufus’ ability to provide price history. By navigating to a product page, tapping Rufus, and asking “price history,” users can obtain valuable information. In the case of the LifeStraw filter, Rufus indicated during this week’s sale, “This is the lowest price on Amazon in the past 30 days.”
In the Amazon app, Rufus also displayed a line graph illustrating the filter’s price fluctuations over the previous month. It showed that the price remained below $20 throughout the entire period and dropped as low as around $14 for several days. Thus, while the offer price was indeed a bargain, it may not have been as significant a discount as advertised during Prime Big Deal Days.
An Amazon representative, Janelle Rasey, mentioned that revealing pricing history is an experimental feature currently available to a limited group of U.S. users. “We strive to enhance customers’ lives and simplify their experiences daily, including assisting them in finding and discovering anything they might want, enabling informed purchasing decisions in our store,” she stated. “We routinely test new features to help customers secure excellent value across our broad selection.”
If Amazon’s trial of sharing price history through Rufus expands and remains, it could be a compelling reason for users to give the chatbot a chance. Trishul Chilimbi, an Amazon vice president overseeing research, noted last week that his teams trained Rufus on all the products, reviews, and Q&A contributions on the company’s site, as well as some publicly available data elsewhere online. In essence, Rufus facilitates easier access to information that a user might otherwise have to gather themselves.
However, subtle or behind-the-scenes data, such as price shifts, are more challenging to acquire. In the case of the LifeStraw filter, popular price tracking tools CamelCamelCamel and Glass It lacked any data when WIRED explored them. Another service, Keepa, provided data dating back to 2017, showing a record-low price of $8 in 2022.
Executives from Keepa and Glass It informed WIRED that they are not worried about competition from Rufus. They assert that their data is more extensive and supports a variety of tools, including price alerts. “Amazon’s efforts to provide price history data directly to users is beneficial for all of us as consumers who seek to make informed purchasing choices,” remarked Amor Avhad, Glass It’s founder.
Amazon has faced criticism for a lack of transparency in various aspects of its operations. In two ongoing lawsuits, the U.S. Federal Trade Commission has separately accused Amazon of deceptive and anticompetitive practices that have obscured details about subscription renewals and sales algorithms for shoppers and sellers alike. However, regarding product pricing, Amazon has, in some respects, been open with consumers.
Customers who leave an item in their cart for a while are notified by Amazon if there has been a price change—up or down—by even a single cent since they added it. If Amazon believes that its pricing isn’t competitive with other retailers, it may hide the Buy button and require customers to take extra steps to finalize their purchase.
The impact of price history access on merchants caught in the middle remains to be seen. Tristan Månsson-Perrone of Radius Outfitters, an Amazon seller whose tool roll was featured in recent deals, mentions that he does not frequently change his pricing. Thus, customers may not find much insight from asking Rufus, he notes.
Overall, Amazon has stressed that it wants Rufus—named after a corgi from the company’s first office—to be a reliable companion. When asked to summarize reviews, it points out the advantages and disadvantages. It recommends products from outside Amazon and avoids coming across as overly promotional.
However, WIRED encountered difficulties getting Rufus to assist with ethical shopping inquiries, like which brands were backing certain sides in conflicts or elections. There is also ongoing uncertainty about whether tools like Rufus will diminish the revenue of the professional reviews industry, including WIRED itself. These limitations and concerns felt secondary when Rufus seemed like an unoriginal copycat. With exclusive pricing information, it might begin to transform into a shopper’s best ally.
“Rufus is created to assist customers in saving time and making more informed purchasing decisions by addressing questions on a wide range of shopping needs and products right within the Amazon Shopping app,” Amazon stated in a blog post announcing the chatbot’s broader availability. “It’s akin to having a shopping assistant with you whenever you’re in our store.”
Amazon initially introduced Rufus in February but had only made it available to a select group of users in the app until this point. It has been trained using Amazon’s vast product database, customer reviews, community Q&As, and information sourced from the internet, being all-knowing when it comes to shopping.
You can inquire about product specifics (like whether this shirt is machine-washable) or what other customers think about it. You can ask the chatbot for tailored merchandise suggestions and category comparisons: “Compare OLED and QLED TVs” or “distinguish trail shoes from running shoes.” You can also ask Rufus why a particular type of product might be beneficial. It can inform you about your order’s arrival time and your previous purchases of favorite items like socks or sunscreen.
In an unexpected twist for a shopping chatbot, Rufus—which is inspired by the charming Welsh corgi owned by two early employees who frequently brought their dog to the office—can also respond to general inquiries, ranging from political matters to philosophical themes. A chatbot that can suggest which mop to buy, link to that item, and address existential questions? I was eager to give it a try.
I opened the Amazing Shopping app and tapped the small orange and teal icon at the bottom right corner, signaling that Rufus was ready to engage. Anyone who has interacted with a customer service chatbot is familiar with the process: Type or use voice dictation to ask a question (I chose to type), and receive a response.
Initially, I asked Rufus which product is ideal for cleaning hardwood floors, and I promptly received advice on considerations to keep in mind (“look for pH-neutral cleaners that will not damage the floor’s finish”). Rufus also provided five specific floor-cleaning products, complete with links to their Amazon shopping pages, of course. Rufus even proposed additional questions I might want to explore, such as “How often should hardwood floors be cleaned?”
I could have gotten similar answers from Google or ChatGPT, and I did when I posed the same questions to both. However, for shoppers inclined to make purchases on Amazon, Rufus simplifies the path from research to purchase.
Rufus On The Meaning Of Life
The fact that Rufus can engage with non-shopping topics as well indicates that Amazon is positioning the product as both a shopping facilitator and a competitor to Google/ChatGPT, aiming to keep consumers engaged within the Amazon ecosystem for longer. (Amazon did not provide comments on that hypothesis, only saying that Rufus can support customers at any phase of their shopping journey.)
After clarifying my hardwood floor questions, I proceeded to see if Amazon’s AI assistant could help with more profound issues. I asked it about the meaning of life, doing so twice for consistency.
Rufus recognized that I had posed a timeless question pondered by philosophers, theologians, and intellectuals throughout history. It then outlined and elaborated on elements typically believed to contribute to a meaningful life: seeking fulfillment, participating in something greater, and living ethically. It suggested other questions I could explore further, such as “How can one effectively research reputable charities?”
The second time I inquired about the meaning of life, I noticed—given that this is a shopping site—it provided Amazon search links to philosophy, spiritual, and self-help books, as well as meditation supplies and yoga mats. I appreciated that Rufus remained focused on the topic at hand and didn’t reference my decidedly non-spiritual shopping history while we ventured into discussions beyond household items. Links to curtain rods during such a serious conversation would have felt uncomfortable.
It’s still early in Rufus’s development, but hopefully, it will keep that level of decorum. Amazon indicates that it will enhance Rufus based on customer feedback, so I plan to return and observe its growth as an AI philosopher.
Amazon has marketed its shopping AI chatbot Rufus as a remedy for individuals feeling overwhelmed by the extensive selection of products available on its platform. However, since it’s Amazon, this will now inevitably include advertisements, as noted first by Adweek. Rufus (named after a pet corgi belonging to early Amazon employees) utilizes AI to research products and suggest purchases through conversational interactions.
“To assist customers in uncovering more products with Amazon’s AI-driven shopping assistant, known as Rufus, your ads may appear in placements related to Rufus,” an update to advertisers clarifies. “Rufus might generate accompanying text based on the conversation’s context.”
Rufus provides results based on Amazon’s extensive product catalog, customer reviews, and community Q&As. In some aspects, the advertising serves merely as another category of information. The update aligns it more closely with how the traditional Amazon shopping search operates. Rather than presenting ‘sponsored’ suggestions as links on the page, Rufus will directly foreground advertised products as it responds to your inquiries.
Clearly, Amazon aims not to inundate Rufus users with irrelevant ads, hence the mention of “context.” Therefore, when you request Rufus compare different items or seek gift suggestions, you won’t receive unrelated recommendations. Rather, any product that an advertiser has sponsored will likely be included in the comparison or highlighted among the initial collection of gift suggestions.
Rufus remains largely an experiment, and Amazon has cautioned that its responses may not always be accurate. What implications this has for sponsored products is uncertain, but presumably, Amazon seeks to avoid errors that could negatively impact the ads it presents for its advertisers.
AI Marketing Professionals
Rufus is not the first to integrate advertising within its AI framework. Microsoft started experimenting with advertisements through its Copilot AI chatbot a year prior. Similarly, the AI conversational search platform Perplexity has begun to feature sponsored suggestions in its search results, resembling Google’s business model more closely.
However, Amazon is the dominant player in e-commerce, and other platforms will likely observe its progress with interest. If Rufus proves beneficial for advertisers on Amazon, it’s certain that competitors will quickly follow suit if they haven’t already. While the advertisements may just serve as a revenue source for Amazon, Rufus could signify the next evolution in online advertising.
Rufus AI is powered by a large language model (LLM) that is specifically tailored for shopping. It can address questions related to buying needs, product comparisons, and durability, as well as offer suggestions shaped by the conversational context. In the most recent version of the Amazon app, users in the U.S. can access Rufus by tapping the Rufus icon located in the bottom navigation bar. This opens a chat window where they can pose questions or select suggested queries to initiate a conversation.
For instance, a user might ask, “Is this coffee maker easy to clean and maintain?” and then click on related questions presented by Rufus AI in the chat interface to gain further insights about the product. They may also click on “What do customers say?” to obtain a quick summary of reviews from prior buyers. Additionally, users can monitor their packages, ask about the delivery time of current orders, and help find past orders through Rufus.
In addition to product suggestions and comparisons, Amazon Rufus AI can aid users in staying informed about fashion trends or the latest technological advancements, like new product models or popular designs. When users ask about products suitable for specific geographical locations, Rufus can offer local weather, humidity, and other pertinent information.
Rufus AI was first introduced in beta in February 2024. According to Amazon, users have already posed “tens of millions” of specific product-related questions to Rufus, which has provided responses drawn from detailed product information, reviews, and community Q&A data. Amazon has indicated that Rufus will continue to evolve over time.
Conclusion
The debut of Amazon’s Rufus AI represents a significant advancement in improving the online shopping experience. With its capability to deliver detailed product information, answer specific user questions, and provide tailored recommendations, Rufus is poised to transform how customers engage with the Amazon platform. As Amazon works to refine and enhance this groundbreaking tool, users can anticipate an even more seamless and informed shopping experience.
If you’re acquainted with AI chatbots like ChatGPT or Gemini, Rufus operates similarly. It’s trained on “Amazon’s vast product catalog, customer reviews, community Q&As, and data from various web sources,” which it processes, links together, and condenses to answer your inquiries.
These bots don’t merely copy and paste; they aim to create new and original replies, so, as always with these AI models, errors can occur. Amazon notes that Rufus “may not always get things right,” so it’s advisable to verify important information—don’t assume everything Rufus provides is entirely accurate, and avoid sharing personal or sensitive information with it.
If you’re accessing the Amazon Shopping app on Android or iOS, Rufus can be found on the right side of the navigation bar at the bottom (the Rufus symbol consists of two blobs with a star next to them): Tap the icon to open a chat window and start posing questions. When shopping on the Amazon website, the Rufus button appears on the left side of the toolbar at the top.
Immediately, Rufus will offer suggestions for questions to ask—some of which might relate to recent searches or purchases. You can click or tap on any suggested questions or type in your own inquiry, and Rufus will spring into action.
After each answer, you have the chance to ask follow-up questions (Rufus retains the conversation history), and you can rate the responses you received (using the thumbs up or thumbs down icons). You are able to clear your chat history in Rufus, but this is only possible within the app, not on the web: Tap on the three dots (top right), select Manage chat, and then Clear chat history.
Rufus is aware of the product you’re viewing on the Amazon platform, so you can inquire about a product on your screen. For instance, you might ask how much an item weighs, the materials used to make it, or its battery life. In certain instances, the bot will reference details in the product listing.
Beyond individual product pages, Rufus can handle shopping-related questions in a broader context. You might want to know what the best tool is for a specific task, how two items compare, or what a particular item does (and how). You can also ask about prevailing trends in product categories, and discover what’s trending among other shoppers, for example.
You can ask quite imaginative questions. For instance, you might inquire about the tools and materials required to build a garden shed, gift ideas suitable for a 5-year-old’s birthday, or additional supplies you might need for a dinner gathering. Rufus will try to offer you some helpful suggestions.
This is where Rufus shares similarities with other generative AI chatbots: If you’re seeking shopping ideas or are unsure about how to compare two types of TV technology, you can get insights. The questions you pose don’t necessarily have to be exclusively about items sold on Amazon, but you’re likely to see links to related products on the site regardless.
Finally, you can also converse with Rufus regarding the status of your current orders, or recall when you last ordered (for instance) packing tape. Sometimes you will receive a direct answer, while other times, you’ll get a link to a relevant page on the Amazon site for more information. And if you’re curious about the origin of the name, it’s inspired by one of Amazon’s early employees.
Although e-commerce platforms like Amazon have simplified shopping, the vast array of options and categories can be confusing. With countless product choices, selecting the right one becomes challenging. Amazon has been incorporating AI into its shopping experience for some time, featuring summarized reviews and personalized product recommendations. To further enhance the shopping experience, Amazon has introduced an AI chatbot known as Rufus. Currently in beta, Rufus is available to select Android and iOS users in India.
According to Amazon, users can pose questions about specific products, such as “what to consider when purchasing a washing machine,” or “is a fitness band or smartwatch better for me?” and Rufus will address these queries and suggest products tailored to the user’s interests.
Rufus is an AI chatbot focused on shopping.
Amazon’s Rufus is clearly crafted and trained as a shopping-first AI chatbot, designed to browse the extensive catalog to respond to inquiries and recommend appropriate products.
In addition to addressing queries about specific items, if you’re interested in buying a smartphone, Rufus can help you filter products using specific criteria like battery life, display size, performance, storage, and more.
When we tasked Rufus with finding the most affordable foldable smartphones on the market, it pointed us to the OnePlus Open as one of the top options available. Although this phone is competitively priced among foldable devices, it’s important to note that models like the Samsung Galaxy Z Fold4 are currently priced just slightly lower than the OnePlus Open, making them viable alternatives for budget-conscious consumers.
However, we encountered some discrepancies in Rufus’s recommendations. For instance, when we inquired about smartphones known for their exceptional battery life, Rufus suggested several discontinued models, including the Asus Zenfone 9 and the Realme 9 Pro+ 5G. These outdated devices no longer represent current standards, which is misleading for anyone seeking to make a purchase based on battery performance.
In stark contrast, a direct search on Amazon yields much more relevant and up-to-date recommendations, indicating that the platform has a more effective approach for consumers looking for the latest technology. This experience reinforces the notion that traditional blogs and established tech websites remain invaluable resources for accurate product recommendations, especially within the consumer tech space. These sources often provide the most reliable information, helping buyers navigate the ever-evolving landscape of technology products.
Amazon has been extensively utilizing AI for over 25 years to enhance customer experiences. The personalized suggestions customers receive while shopping on Amazon, the optimized pick paths in our fulfillment centers, our drone delivery services, the conversational features of Alexa, and our checkout-free Amazon Go locations are just a few instances of experiences driven by AI. We believe that generative AI will transform nearly all customer experiences as we currently know them.
In the past year, we have launched several new capabilities in Amazon’s store powered by generative AI to simplify and enhance shopping. Our AI-generated review highlights allow customers to quickly grasp common themes from numerous reviews at a glance, helping them to understand customer insights rapidly.
We also recently rolled out our Fit Review Highlights feature, which provides tailored size guidance and insights so customers can determine which size will be the best fit for them. Additionally, we are leveraging generative AI to enhance product listings, assisting our selling partners in crafting more engaging and effective titles and product descriptions, while also enriching existing listings.
Rufus serves as a generative AI-powered expert shopping assistant, trained on Amazon’s vast product catalog, customer reviews, community Q&As, and online information to address customer queries regarding various shopping needs and products, offer comparisons, and suggest recommendations based on conversational context.
From broad inquiries at the beginning of a shopping journey, such as “what should I consider when purchasing running shoes?” to comparisons like “what distinguishes trail running shoes from road running shoes?” to specific questions such as “are these durable?”, Rufus significantly enhances how customers discover and find the best products that meet their requirements, seamlessly integrated into the familiar Amazon shopping experience.
We are introducing Rufus in beta and gradually rolling it out to customers, starting with a small group of users in the U.S. using our mobile app, and progressively expanding access to the rest of our U.S. customers in the upcoming weeks.
With Rufus, customers can:
Learn what factors to consider while shopping for product categories: Customers can carry out more general product research on Amazon, posing questions like “what should I consider when choosing headphones?”, “what should I keep in mind when detailing my car at home?”, or “what constitutes clean beauty products?” and receive useful information that guides their shopping journey.
Shop by occasion or need: Customers can search for and discover products tailored to activities, events, purposes, and other specific scenarios by asking various questions like “what do I need for golfing in cold weather?” or “I want to begin an indoor garden.” Rufus then suggests shoppable product categories—ranging from golf base layers, jackets, and gloves to seed starters, potting soil, and grow lights—and provides related questions that customers can click on for more specific searches.
Receive assistance in comparing product categories: Customers can ask “what is the difference between lip gloss and lip oil?” or “compare drip coffee makers and pour-over coffee makers” to find the product that best fits their needs, enabling them to make more informed purchasing choices.
Obtain top recommendations: Customers can inquire about specific recommendations like “what are good gifts for Valentine’s Day?” or “what are the best dinosaur toys for a 5-year-old?” Rufus produces results tailored to the exact question, making it quick and convenient for customers to browse more refined options.
Ask questions about a specific product while viewing its detail page: Customers can utilize Rufus to quickly obtain answers to specific inquiries about individual products when checking out the product’s detail page, such as “is this pickleball paddle suitable for beginners?”, or “is this jacket machine washable?”, or “is this cordless drill comfortable to hold?”. Rufus generates responses based on the listing details, customer reviews, and community Q&As.
With Rufus, customers now have the opportunity to shop alongside a generative AI-powered expert that thoroughly understands Amazon’s offerings, integrating information from the web to assist them in making better-informed purchase decisions.
Initiate the Rufus beta program
Rufus is now accessible to a select group of customers during their next update of the Amazon Shopping app. Those participating in the beta can easily start typing or voicing their queries in the search bar of Amazon’s mobile app, where a Rufus chat window will appear at the bottom of the screen. Users can expand this chat window to view answers, click on suggested inquiries, and ask follow-up questions directly in the chat window. At any point, customers can close Rufus to revert to conventional search results by swiping down and hiding the chat window.
Rufus provides responses by utilizing relevant data from Amazon and the web, aiding customers in making more informed shopping choices. Being in the early stages of generative AI, the technology may not always deliver accurate results. We will continuously refine our AI models and adjust responses over time to enhance Rufus’s usefulness. Customers are encouraged to share their thoughts by rating answers with a thumbs up or down, and have the option to give detailed feedback as well.
We are enthusiastic about the possibilities of generative AI and will keep testing new features to streamline the process of finding, discovering, researching, and purchasing products on Amazon. We anticipate gradually expanding Rufus availability to more U.S. customers in the weeks ahead.
Overview of the solution
At its foundation, Rufus operates with a large language model (LLM) that has been trained on Amazon’s product catalog along with various web information. Deploying LLMs can pose challenges, necessitating a balance among factors such as model size, accuracy, and performance during inference. Although larger models typically exhibit superior knowledge and reasoning abilities, they also incur higher costs due to increased compute requirements and greater latency. Rufus must be deployed and scaled effectively to handle the significant demand during events like Amazon Prime Day.
Considerations for this scalability include performance needs, environmental impact, and hosting costs. To address these challenges, Rufus utilized a mix of AWS services: Inferentia2 and Trainium, Amazon Elastic Container Service (Amazon ECS), and Application Load Balancer (ALB). Additionally, the Rufus team collaborated with NVIDIA to run the solution on NVIDIA’s Triton Inference Server, enabling the model to leverage AWS hardware.
Rufus’s inference operates as a Retrieval Augmented Generation (RAG) system, where responses are enhanced by sourcing additional data such as product details from Amazon search results. These results are tailored to the customer inquiry, ensuring that the LLM produces dependable, high-quality, and precise answers.
To prepare Rufus for Prime Day, the team developed a heterogeneous inference architecture utilizing multiple AWS Regions backed by Inferentia2 and Trainium. This multi-Regional approach provided two main advantages: it offered extra capacity during peak demand times and enhanced the overall resilience of the system.
The Rufus team could leverage both Inf2 and Trn1 instance types. Since both Inf2 and Trn1 instances operate with the same AWS Neuron SDK, the Rufus team was able to maintain service for the same Rufus model across both instance types. The only adjustment needed was the degree of tensor parallelism (24 for Inf2, 32 for Trn1). Utilizing Trn1 instances also resulted in a further 20% reduction in latency and an improvement in throughput compared to Inf2.
Enhancing inference performance and resource utilization
Within each Region, the Rufus inference architecture employed Amazon ECS to manage the foundational Inferentia and Trainium-powered instances. By overseeing the underlying infrastructure, the Rufus team only had to introduce their container and settings by defining an ECS task. Each container hosted an NVIDIA Triton Inference Server utilizing a Python backend, running vLLM along with the Neuron SDK. vLLM is an efficient memory inference and serving engine designed for high throughput. The Neuron SDK simplifies the adoption of AWS chips for teams and supports various libraries and frameworks, including PyTorch Lightning.
The Neuron SDK delivers an efficient LLM inference solution on Trainium and Inferentia hardware with optimized performance that accommodates a broad array of transformer-based LLM architectures. To minimize latency, Rufus collaborated with the AWS Annapurna team to explore several optimizations, including INT8 (weight only) quantization, continuous batching with vLLM, and enhancements in resource, compute, and memory bandwidth within the Neuron compiler and runtime. These optimizations have been deployed in Rufus’s production environment and are available for use starting from the Neuron SDK 2.18 version.
In order to minimize the total waiting time for customers to receive responses from Rufus, the team designed an inference streaming architecture. Given the significant computational and memory demands associated with LLM inference, the overall time required to complete a response for a customer query can span several seconds.
By implementing a streaming architecture, Rufus can deliver tokens immediately after they are generated. This enhancement enables customers to start accessing the response in under 1 second. Furthermore, various services collaborate through gRPC connections to smartly combine and improve the streaming response in real-time for users.
With this integration, Rufus achieved a vital optimization: continuous batching. Continuous batching allows a single host to significantly boost throughput. Additionally, continuous batching offers distinct advantages over other batching methods, such as static batching.
For instance, when utilizing static batching, the time to first token (TTFT) rises linearly as the number of requests in a batch increases. Continuous batching focuses on optimizing the prefill phase for LLM inference, helping to maintain TTFT at manageable levels even when handling numerous simultaneous requests. This capability enabled Rufus to deliver a favorable experience with low latency for the initial response while also enhancing single-host throughput to keep serving costs efficient.
How to Begin Using Amazon AI Chatbot Rufus?
To start using Amazon’s AI chatbot, Rufus, follow these straightforward steps:
- Update Your Amazon Shopping App: Ensure you have the most recent version of the Amazon Shopping app installed on your device.
- Find Rufus: Search for the Rufus icon (a bubble chat with sparkle) located in the navigation bar at the bottom of your display.
- Ask Questions: Enter or voice your shopping-related inquiries into the search bar. Rufus will respond with information based on product details, customer reviews, and community questions and answers.
- Explore Features: Utilize Rufus to gain insights into product information, receive recommendations, and compare different options.
Key Features of Amazon AI Chatbot Rufus
Amazon’s AI chatbot Rufus enhances your shopping journey by providing personalized suggestions, product comparisons, and order tracking.
Some features of AI Chatbot Rufus include:
- Product Information: Delivers comprehensive answers derived from product listings, customer reviews, and community Q&As.
- Comparison: Aids in comparing the attributes of various products, such as gas versus wood-fired pizza ovens.
- Trend Updates: Keeps shoppers updated on the latest products and trends.
- Order Tracking: Allows quick access to package tracking and previous orders.
- Personalized Recommendations: Recommends items tailored to specific customer requirements and preferences.
- Streamlining Product Discovery: Proposes relevant product categories and inquiries to assist shoppers in locating their needs.
Frequently Asked Questions
How many questions has Rufus replied to thus far?
Customers have posed tens of millions of questions to Rufus, and Amazon values their input.
Can I utilize Rufus for queries not related to shopping?
Rufus is mainly designed for shopping-related inquiries, but feel free to try asking anything—it may surprise you!
What technology underpins Rufus?
Rufus operates on generative AI to effectively comprehend and respond to customer inquiries.
Is Rufus available on desktop or exclusively on mobile?
Rufus is currently available only through the Amazon Shopping app on mobile devices.
Conclusion
Amazon’s Rufus AI chatbot represents a significant advancement in accessible technology, offering users a smooth way to interact with AI-driven assistance. As you begin your experience with Rufus, remember to explore its full capabilities by trying out various commands and features. Whether for shopping help, information gathering, or casual chatting, Rufus simplifies your online interactions.
Embrace this cutting-edge tool as it evolves and improves continuously, adjusting to your needs and enhancing your everyday life. With Amazon’s dedication to innovation, AI Chatbot Rufus aspires to be more than just a helpful assistant but a reliable partner in navigating the complexities of contemporary life. Start your journey with Rufus today and see how AI can transform your digital experiences like never before.