Tag: Axion

  • Google’s Billion-Dollar Pivot: AI-Assisted Migration to Arm Architecture

    Google has embarked on one of the most ambitious projects in its history — a full-scale migration of its internal infrastructure to the Arm architecture. According to the company, approximately 30,000 software packages have already been adapted, including key services such as YouTube, Gmail, and BigQuery. Ultimately, Google aims to complete the migration across all systems, enabling applications to run seamlessly on both x86 processors and the company’s new in-house Axion chips.

    The technical foundations of this initiative are outlined in a preprint titled “Instruction Set Migration at Warehouse Scale” and on Google’s official engineering blog. The publication’s authors — research engineer Parthasarathy Ranganathan and developer relations specialist Wolf Dobson — admit that at the outset, the team anticipated numerous challenges: discrepancies in floating-point computations, variations in multithreading behavior, and complications arising from platform-dependent operations and performance tuning. Yet, as practice has shown, modern compilers and testing frameworks have already resolved many of these issues.

    In the early stages, engineers manually ported Google’s largest internal services — F1, Spanner, and Bigtable. The greatest difficulty lay not in architectural incompatibilities but in the sheer labor of maintenance: fixing x86-dependent tests, modernizing outdated build and release systems, resolving deployment errors, and ensuring the stability of mission-critical services. In total, around 30,000 applications have been processed — an immense volume of code that prompted Google to employ its full suite of automation tools and develop a new AI-based assistant known as CogniPort.

    As the project’s authors explain, CogniPort analyzes build and testing errors. When a library, binary, or test fails to compile, the agent automatically attempts to correct the issue. In its Blueprint Editing mode, the system generates commits requiring complex, multi-layered modifications beyond simple code edits. Experimental data shows that CogniPort achieves an efficiency rate of around 30%, particularly in fixing test cases, platform-specific conditionals, and data representation errors.

    Although the current success rate remains modest, Google continues to scale up the process — with roughly 70,000 additional packages yet to be adapted. The ultimate goal is to achieve complete cross-architecture compatibility, enabling the Borg orchestration system — the internal precursor to Kubernetes — to distribute workloads between servers with maximum efficiency.

    The transition carries significant economic implications as well. Google estimates that servers powered by Axion processors deliver up to 65% better performance-per-dollar compared to x86 platforms and approximately 60% higher energy efficiency. These gains promise to reduce infrastructure costs and lower the overall energy consumption of Google’s vast global data center network.

  • Axion Unveiled: Google’s Arm-Based Powerhouse

    According to Wccftech, Google unveiled its inaugural in-house developed chip based on the Arm architecture, officially named Axion, at the 2024 Cloud Next conference. This chip, specifically designed for data centers, is touted by Google to offer a plethora of performance and efficiency benefits.

    Google introduced Axion as a brand-new TPU v5p tensor processor, boasting double the raw performance of the preceding TPU v4, set to operate within a TPU cluster comprising 8960 chips. TPUs, or Tensor Processing Units, are processors engineered by Google to tackle computational problems in machine learning, particularly in deep learning. Axion is capable of delivering 50% higher performance than the current virtual machines powered by x86 processors, while consuming 60% less power, and it operates 30% faster than the quickest Arm architecture chips presently available in the cloud. Google has already integrated Axion across various services, including its YouTube advertisements on Google Cloud.

    Mark Lohmeyer, Vice President of Google Cloud, highlighted the advantages of Axion for customers, stating, “We are facilitating our clients’ transition of their existing workload to Arm with ease. Axion is built on an open foundation, allowing any client using Arm to seamlessly adopt Axion without the need for redesigning or rewriting their applications.”

    Google also disclosed plans to expand the application of Axion and will unveil its public usage plans in the future, although specific products or customers eligible for access remain undisclosed at this time. The launch of Axion also affirms Google’s capability for independent chip development, lending credence to rumors about the entirely in-house developed Tensor G5 chip potentially debuting in the Pixel 10 series, expected to utilize TSMC’s 3nm process technology, advancing beyond the Tensor G4/G3.