Без рубрики

Microsoft has revamped Bing’s search technology by integrating advanced language models, promising cost savings alongside quicker and more accurate search outcomes.

The latest updates feature a combination of large language models (LLMs), small language models (SLMs), and cutting-edge optimization strategies aimed at refining search performance.

Advancements in Search Technology

Microsoft revealed these enhancements in a recent announcement, emphasizing their commitment to improving search. According to the company:

“At Bing, innovation drives our approach to search. By utilizing both Large Language Models (LLMs) and Small Language Models (SLMs), we’ve reached a pivotal moment in boosting search efficiency. While transformer models have been effective, evolving user demands require even more capable systems.”

Balancing Performance and Efficiency

Implementing LLMs often raises challenges related to speed and operational costs. To address this, Bing developed SLMs, which are said to operate 100 times faster than traditional LLMs. The company elaborated:

“LLMs are resource-intensive and slow to execute. To enhance performance, we’ve trained SLMs, achieving approximately 100x throughput improvement over LLMs, resulting in faster and more precise query processing.”

Additionally, Bing employs NVIDIA’s TensorRT-LLM technology to optimize the performance of these models. TensorRT-LLM enables faster and more cost-effective execution of large models on NVIDIA GPUs.

Enhancing Deep Search

Microsoft’s technical documentation highlights how integrating TensorRT-LLM has transformed Bing’s “Deep Search” functionality, which uses SLMs to deliver relevant results in real-time.

Previously, Bing’s transformer model exhibited a latency of 4.76 seconds per batch (20 queries) and a throughput of 4.2 queries per second per instance. With TensorRT-LLM, latency has dropped to 3.03 seconds per batch, while throughput increased to 6.6 queries per second per instance. This represents a 36% reduction in latency and a 57% improvement in operational efficiency.

Microsoft affirmed:

“Our mission is to deliver the best search experience without compromising quality. TensorRT-LLM allows us to reduce inference time, enhancing overall response speed while maintaining top-notch results.”

Benefits for Users

These advancements offer several advantages for Bing users:

  • Faster and more responsive search results
  • Enhanced accuracy through SLMs, ensuring better context in answers
  • Improved cost-effectiveness, enabling future innovations and upgrades

The Significance of Bing’s Strategy

By adopting a hybrid approach of LLMs, SLMs, and TensorRT optimization, Bing is positioning itself as a leader in handling increasingly complex user queries.

As search engines strive to meet growing expectations, Bing’s use of smaller, highly optimized models demonstrates how modern search can balance speed, precision, and efficiency.

31.12.2024
search

Bing Search Revamped: Faster and Smarter Results

Microsoft has revamped Bing’s search technology by integrating advanced language models, promising cost savings alongside quicker and more accurate search outcomes. The latest updates feature a […]
21.11.2024
google-search-mobile

Google Rolls Out Major November 2024 Core Algorithm Update

Google has officially started deploying its November 2024 core update, which is expected to take roughly two weeks to fully roll out. This update, part of […]
23.09.2024
google-search-mobile

Google Enhances Indexing API Guidelines with Spam Detection Warning

Google has strengthened the rules surrounding its Indexing API, cautioning users about spam detection and the possibility of losing access if the API is misused. The […]
12.09.2024
industrialaaaaa

Apple’s iPad Ad Sparks Controversy Over Hydraulic Press Misuse

A tool traditionally seen as a metalworker’s asset, the hydraulic press, was recently cast in a negative light due to Apple’s controversial promotional campaign. In a […]
27.08.2024
googlegoogle

Google Updates Guidelines for Organization Structured Data

Google has refreshed its guidelines on Organization Structured Data, providing a more precise and thorough explanation of its role and advantages. The revised introduction to the […]
12.08.2024
dolly22

Google Launches AI-Powered Tools for Performance Max Campaigns

Google has unveiled advanced AI tools for Performance Max campaigns, aimed at improving reporting, creative features, and brand safety for advertisers across various campaign formats. Google […]
09.07.2024
AIAIAIIAIAIA

Meta Introduces Four New AI Models for Developers

The Fundamental AI Research team at Meta has launched four new AI models, now accessible to researchers and developers for creating innovative applications. A detailed paper […]
17.06.2024
google

Google Issues Reminder on Robots.txt Usage to Block Action URLs

Gary Illyes of Google has reiterated the importance of utilizing robots.txt to prevent crawlers from accessing URLs that execute actions like adding items to carts or […]
26.05.2024
google_core_update

Google CEO Addresses AI’s Effect on Search Traffic Concerns

Google CEO Sundar Pichai discusses the influence of AI on search traffic, asserting it boosts user engagement. In a recent conversation, Google CEO Sundar Pichai explored […]
14.05.2024
Google-Algorithm-Updates-of-2021-1-1

Significant Overhaul to Google’s Product Structured Data Documentation

Google has undertaken a major revamp of its extensive Product Structured Data documentation, breaking it down into three separate pages, each focusing on specific topics. This […]