顯示包含「Technology」標籤的文章。顯示所有文章
顯示包含「Technology」標籤的文章。顯示所有文章

2026年1月5日星期一

[Technology] Nvidia’s Next‑Gen AI Chips Are Now in Full Production

 

Nvidia’s Next‑Gen AI Chips Are Now in Full Production

  • CEO Jensen Huang announced at CES 2025 that Nvidia’s upcoming chip generation—the Rubin platform—is fully in production and already being tested by AI firms.

  • These chips can deliver 5× the AI computing performance of the previous generation when running chatbots and similar applications.

What the Rubin Platform Includes

  • Rubin consists of six different Nvidia chips.

  • The flagship configuration includes:

    • 72 GPU units

    • 36 new CPU units

  • Systems can be scaled into “pods” containing 1,000+ Rubin chips.

How Nvidia Achieved the Performance Jump

  • Huang said the gains come from a new proprietary data format Nvidia hopes the industry will adopt.

  • Despite only a 1.6× increase in transistor count, performance jumps dramatically due to this new data approach.

Competition Is Intensifying

  • Nvidia still dominates AI model training, but faces growing competition from:

    • AMD

    • Google (a customer and a rival)

    • Meta and others collaborating on alternative AI chips.

New Technologies Introduced

  • Context Memory Storage: Helps chatbots respond faster during long conversations at massive scale.

  • Co‑packaged optics networking switches: Compete with Broadcom and Cisco for linking thousands of machines efficiently.

Self‑Driving Car Software

  • Nvidia is releasing Alpamayo, a decision‑making system for autonomous vehicles.

  • Both the models and the training data will be open‑sourced to improve transparency and trust.

Strategic Moves

  • Nvidia recently acquired talent and chip tech from Groq, strengthening its AI hardware capabilities.

  • The company is also positioning Rubin to outperform older chips like the H200, which the U.S. government now allows to be sold to China.

2025年11月25日星期二

[Technology] How popular is Gemini 3?

Gemini 3 is currently very popular, with significant user engagement and performance metrics. It has 2 billion monthly users for AI Overviews and 650 million users for the Gemini app. Gemini 3 Pro has topped the LMArena leaderboard, showcasing its advanced capabilities. The model is integrated into Google Search and developer tools, indicating widespread adoption and usage. It has been recognized as Google's most intelligent model, with improvements in reasoning and multimodal understanding. Overall, Gemini 3 is positioned as a leading AI model, reflecting its growing popularity and effectiveness in various applications.


Individual Comments on Gemini 3

Marc Benioff (Salesforce CEO):
After years of daily ChatGPT use, he said Gemini 3 was a game‑changer: “Holy shit … I’m not going back. The leap is insane — reasoning, speed, images, video… everything is sharper and faster.”

His endorsement signals a dramatic shift in enterprise perception.

Wei‑Lin Chiang (CTO, LMArena):
Gemini 3 Pro holds a “clear lead” in coding, math, and creative writing.
Surpasses Claude 4.5 and GPT‑5.1 in agentic coding and visual comprehension.

Alex Conway (DataRobot engineer):
Highlighted Gemini’s performance on ARC‑AGI‑2 reasoning benchmark, scoring nearly twice as high as GPT‑5 Pro at one‑tenth the cost.

Also doubled GPT‑5.1’s score on SimpleQA, making it strong for niche knowledge.

Tim Dettmers (Carnegie Mellon):
Called it a “great model” but noted UX issues: doesn’t always follow instructions precisely.

Joel Hron (CTO, Thomson Reuters):
Found Gemini 3 strong in legal/tax reasoning tasks, outperforming Gemini 2.5 and some Anthropic/OpenAI models.

Louis Blankemeier (CEO, Cognita):
Excited by numbers but cautious: Gemini struggled with subtle radiology cases (rib fractures, rare conditions).
Compared radiology challenges to self‑driving cars — edge cases remain tough.

Matt Hoffman (Head of AI, Longeye):
Praised Gemini’s image generator for synthetic datasets but said benchmarks don’t map neatly to law enforcement use cases.

Thomas Schlegel (VP Engineering, Built):
Sees Gemini 3 as “everything we love about Gemini on steroids”.
Still plans to use a mix of models (Claude for coding, OpenAI for business reasoning).

Tanmai Gopal (CEO, PromptQL):
Acknowledged Gemini’s leap but said it’s “not the end of anything” for competitors.
Prefers Claude for code, ChatGPT for search, GPT‑5 Pro for brainstorming, but may adopt Gemini for consumer tasks.

Andrej Karpathy (AI researcher):
Positive early impression: “tier 1 LLM” with strong personality, humor, and vibe coding.
Noted quirks like refusing to accept the year 2025 or forgetting to turn on Google Search.




Reference