Vertex AI Logo
BusinessFreemium

Vertex AI

Unified, fully-managed Google Cloud platform for building, training, deploying, and monitoring ML and GenAI models.
8.8
Rating
Freemium
Price
6
Key Features

Overview

Vertex AI is Google Cloud’s end-to-end, fully managed machine learning and generative AI platform. It unifies tools for model discovery (Model Garden), training, fine-tuning, evaluation, deployment (online/batch prediction), MLOps (pipelines, model registry, monitoring), and low-code/no-code agent creation (Agent Builder). Vertex AI provides first-party and third-party models (including Gemini and Imagen), supports custom training with a wide range of VM and accelerator types, integrates with BigQuery, Cloud Storage, and Notebooks, and offers vector search, feature store, and evaluation services to support production ML workflows and enterprise use cases.

Details

Developer
cloud.google.com
Launch Year
2021
Free Trial
Yes
Updated
2025-12-07

Features

Model Garden

Discover, test, and deploy first- and third-party models including Gemini and Imagen; supports model evaluation and fine-tuning.

Vertex AI Studio

Web UI for prototyping, testing, and deploying multimodal models and for interacting with Gemini-style models.

Agent Builder / Agent Engine

Low-code/no-code tools to build conversational agents and runtime for agent execution with free tier quotas.

MLOps Suite

Pipelines, Model Registry, Feature Store, Model Monitoring, and Evaluation tools to orchestrate and monitor model lifecycles.

Training & Inference Infrastructure

Flexible training and prediction options across many VM families, GPUs, TPUs, Ray clusters, and NAS; supports online and batch prediction.

Vector Search & Feature Store

Built-in vector search (indexing/storage/queries) and feature storage for production ML feature serving.

Screenshots

Vertex AI Screenshot
Vertex AI Screenshot
Vertex AI Screenshot

Pricing

AutoML Image - Training (Classification)
Free

Image classification training billed per training hour.

  • Training: US$3.465 per 1 hour (classification)
  • Predefined machine configs used
  • Pay only for actual compute hours
AutoML Image - Training (Object Detection)
Free

Object detection training billed per training hour.

  • Training: US$3.465 per 1 hour (object detection)
  • Predefined machine configs used
  • Pay only for actual compute hours
AutoML Image - Edge Training
Free

Edge-device model training billed per hour.

  • Edge-device training: US$18.00 per 1 hour
  • Optimized for on-device models
  • Billed per training hour
AutoML Image - Deployment & Online Prediction (Classification)
Free

Deployed model hourly price for online predictions.

  • Deployment & online prediction: US$1.375 per 1 hour (classification)
  • Pay per endpoint node hour
  • Must deploy model to serve online predictions
AutoML Image - Deployment & Online Prediction (Object Detection)
Free

Deployed object detection model hourly price for online predictions.

  • Deployment & online prediction: US$2.002 per 1 hour (object detection)
  • Pay per endpoint node hour
  • Must deploy model to serve online predictions
AutoML Image - Batch Prediction
Free

Batch prediction billed per node hour for image models.

  • Batch prediction: US$2.222 per 1 hour
  • Billed per prediction node hour
  • Used for large offline prediction jobs
AutoML Tables - Training (per node hour)
Free

AutoML Tables training charged per node-hour used.

  • Training: US$21.252 per 1 hour (per node)
  • Predefined machine node pricing
  • Pay only for actual node hours
AutoML Tables - Prediction (batch infrastructure note)
Free

Batch prediction uses 40 n1-highmem-8 machines (infrastructure note).

  • Batch prediction uses 40 n1-highmem-8 machines
  • Prediction cost based on used machines and hours
  • No single fixed per-node price listed
Vertex AI Forecast - AutoML Prediction (tiered monthly)
Free

Tiered monthly pricing per 1,000 predicted counts.

  • 0–1,000,000 count: US$0.20 per 1,000 count per month
  • 1,000,000–50,000,000 count: US$0.10 per 1,000 count per month
  • 50,000,000+ count: US$0.02 per 1,000 count per month
Vertex AI Forecast - AutoML Training
Free

Forecast AutoML training charged per training hour.

  • Training: US$21.252 per 1 hour
  • Billed per training node hour
  • Pay only for actual training compute time
Vertex AI Forecast - ARIMA+ Prediction
Free

ARIMA+ predictions charged per 1,000 counts.

  • Prediction: US$5.00 per 1,000 count
  • Training: (not specified in table)
  • Time-series specific pricing
Gen AI Evaluation Service - Automatic (compute) metrics
Free

Compute-based metrics billed per 1,000 characters input/output.

  • Input: US$0.00003 per 1,000 input characters
  • Output: US$0.00009 per 1,000 output characters
  • Uses default auto scorer model Gemini 2.0 Flash
Gen AI Evaluation Service - Legacy-model metrics
Free

Older-model metrics charged per 1,000 characters.

  • Input: US$0.005 per 1,000 input characters
  • Output: US$0.015 per 1,000 output characters
  • Used for metrics based on older evaluation models
Agent Engine - Free tier (monthly)
Free

Monthly free quota for vCPU and RAM for Agent Engine runtime.

  • vCPU: first 180,000 vCPU-seconds free per month
  • RAM: first 360,000 GiB-seconds free per month
  • Helpful for trial and small workloads
Agent Engine - vCPU usage (paid)
Free

vCPU billed per 3,600-second block above free tier.

  • 0–50 hours equivalent per month: free
  • 50 hour and above: US$0.0994 per 3,600 seconds (≈US$0.0994/hour)
  • Billed per project monthly
Agent Engine - RAM usage (paid)
Free

RAM billed per 3,600 gibibyte-seconds above free tier.

  • 0–100 GiB-hour per month: free
  • 100 GiB-hour and above: US$0.0105 per 3,600 gibibyte-seconds (≈US$0.0105/GiB-hour)
  • Billed per project monthly
Custom Training - Example machine: n1-standard-4
Free

Custom training VM price per hour for n1-standard-4.

  • n1-standard-4: US$0.21849885 per 1 hour
  • Use Compute Engine machine types
  • Accelerators charged separately if attached
Custom Training - Example machine: n1-standard-64
Free

Custom training VM price per hour for n1-standard-64.

  • n1-standard-64: US$3.4959816 per 1 hour
  • High vCPU/memory configuration
  • Pay per VM hour
Custom Training - GPU-enabled A2 example: a2-highgpu-8g
Free

A2 GPU VM price per hour (includes required GPUs).

  • a2-highgpu-8g: US$35.401991315 per 1 hour
  • Price includes fixed GPU cost for that instance type
  • Accelerator pricing embedded in instance
Accelerators (training) - NVIDIA_TESLA_A100
Free

GPU accelerator price per hour (A100) plus management fee.

  • NVIDIA_TESLA_A100: US$2.933908 per 1 hour
  • Vertex management fee: US$0.4400862 per 1 hour
  • Combine with Compute Engine VM pricing
Accelerators (training) - NVIDIA_H100_80GB
Free

H100 80GB accelerator price per hour with management fee.

  • NVIDIA_H100_80GB: US$9.79655057 per 1 hour
  • Vertex management fee: US$1.4694826 per 1 hour
  • High-end GPU for training
Disk - pd-standard
Free

Standard persistent disk billed per GiB-hour.

  • pd-standard: US$0.000063014 per 1 GiB-hour
  • Billed hourly by provisioned GiB
  • First 100 GiB per VM typically free (context-dependent)
Disk - pd-ssd
Free

SSD persistent disk billed per GiB-hour.

  • pd-ssd: US$0.000267808 per 1 GiB-hour
  • Higher performance than pd-standard
  • Billed hourly by provisioned GiB
Prediction (Inference) - E2 series example e2-standard-4
Free

Online/batch prediction node hour price for e2-standard-4.

  • e2-standard-4: US$0.1541128 per 1 hour
  • Billed per node hour for online or batch predictions
  • Spot and reserved options supported
Prediction (Inference) - N1 series example n1-standard-4
Free

N1 prediction node hour price for n1-standard-4.

  • n1-standard-4: US$0.219 per 1 hour
  • Used for online/batch prediction node pricing
  • Pay per node hour
Prediction (Inference) - N2 series example n2-standard-4
Free

N2 prediction node hour price for n2-standard-4.

  • n2-standard-4: US$0.2233708 per 1 hour
  • Used for online/batch prediction node pricing
  • Pay per node hour
Prediction (Inference) - A2 GPU example a2-highgpu-1g
Free

A2 GPU prediction node hourly price (includes GPU cost).

  • a2-highgpu-1g: US$4.2244949 per 1 hour
  • Instance includes fixed GPU cost
  • Suitable for GPU-accelerated predictions
Prediction - Optional GPU accelerators (per GPU hour)
Free

Optional GPUs for prediction charged per GPU-hour.

  • NVIDIA_TESLA_P4: US$0.69 per 1 hour
  • NVIDIA_TESLA_P100: US$1.679 per 1 hour
  • NVIDIA_TESLA_T4: US$0.402 per 1 hour; V100: US$2.852 per 1 hour
TPU v5e - example ct5lp-hightpu-1t
Free

TPU v5e pricing per TPU type per hour.

  • ct5lp-hightpu-1t: US$1.38 per 1 hour
  • ct5lp-hightpu-4t: US$5.52 per 1 hour
  • ct5lp-hightpu-8t: US$5.52 per 1 hour (as listed)
Ray on Vertex AI - Example VM n1-standard-4
Free

Ray on Vertex AI training VM pricing per hour (example).

  • n1-standard-4: US$0.2279988 per 1 hour (Ray on Vertex)
  • Accelerators charged separately
  • Billed per VM hour for Ray clusters
Neural Architecture Search - Example VM n1-standard-4
Free

NAS hourly machine price example for n1-standard-4.

  • n1-standard-4: US$0.2849985 per 1 hour (NAS)
  • Predefined and custom configs supported
  • Billed per VM hour for NAS jobs
Neural Architecture Search - Accelerator (A100)
Free

NAS accelerator A100 hourly price (example).

  • NVIDIA_TESLA_A100: US$4.400862 per 1 hour (NAS listed GPU pricing)
  • Attachable to NAS VMs
  • Billed per accelerator hour

Pros & Cons

Pros

  • Fully managed end-to-end ML platform (training, tuning, deployment, monitoring).
  • Native access to Google’s first-party models (Gemini, Imagen) via Model Garden.
  • Rich MLOps features: Pipelines, Model Registry, Monitoring, Evaluation, Feature Store.
  • Broad infrastructure choices (many VM types, GPUs, TPUs, Ray integration).
  • Generous free tiers and integration with Google Cloud $300 trial credits.

Cons

  • Usage-based pricing can be complex to predict for large, variable workloads.
  • Many separate SKUs (compute, accelerators, disk, agent runtime) complicate billing.
  • Some targeted docs/pages (features overview) may surface 404s or vary by locale.
  • Enterprise features and committed-use discounts require contacting sales.

Compare with Alternatives

FeatureVertex AIGoogle GeminiTogether AI
PricingN/AN/AN/A
Rating8.8/109.0/108.4/10
Model EcosystemYesYesYes
Multimodal SupportYesYesNo
Agent BuilderYesPartialNo
Fine-tune ControlYesPartialYes
Deployment FlexibilityYesPartialYes
MLOps ToolingYesPartialPartial
Inference ScalabilityYesPartialYes
Vector SearchYesPartialPartial

Audience

DevelopersBuild, prototype, and deploy apps using managed ML models and Vertex AI Studio.
ML EngineersTrain, tune, and deploy scalable models with MLOps pipelines and monitoring.
Data ScientistsExperiment with AutoML, custom models, and Model Garden models for insights.
EnterprisesOperationalize ML workflows with enterprise-grade monitoring, governance, and support.

Tags

aimachine-learningmlopsgen-aimultimodalmodel-deploymentgoogle-cloudvector-search

Related Articles (7)

Kimi K2 Thinking on Vertex AI: Open-Source Thinking Agent with Long Tool Use and INT4-Accelerated Inference
google.com1mo ago1 min read
Kimi K2 Thinking on Vertex AI: Open-Source Thinking Agent with Long Tool Use and INT4-Accelerated Inference

An open-source thinking agent on Vertex AI that performs long chain-of-thought reasoning with autonomous tool use and INT4-accelerated inference.

Kimi K2 ThinkingVertex AIGenerative AIThinking agent
Train, Customize, and Scale AI Models with Vertex AI: AutoML, Custom Training, and Ray
google.com1mo ago1 min read
Train, Customize, and Scale AI Models with Vertex AI: AutoML, Custom Training, and Ray

Overview of Vertex AI's training options—AutoML, custom training, and Ray on Vertex AI—with guidance to choose the right path.

Vertex AIAutoMLCustom TrainingRay on Vertex AI
Top 100 Gen AI Consumer Apps (5th Edition): Google Gemini Rises, New Entrants Stabilize, and the Brink List
a16z.com3mo ago14 min read
Top 100 Gen AI Consumer Apps (5th Edition): Google Gemini Rises, New Entrants Stabilize, and the Brink List

A16z’s 5th edition ranks the top 100 Gen AI consumer apps, spotlighting Google entries, Brink List movers, and vibe coding trends.

Gen AI appsTop 100Brink ListGoogle Gemini
Ambient AI Scribes in Healthcare: Harnessing Promise While Guarding Safety and Human Connection
jmir.org4mo ago32 min read
Ambient AI Scribes in Healthcare: Harnessing Promise While Guarding Safety and Human Connection

Editorial on ambient AI scribes in healthcare: promise, risks, and research priorities for safe, effective documentation.

Ambient AI scribesAI in healthcareEHR documentationLLMs in medicine
Cleveland Clinic’s AI Scribe Pilot: 3 Key Lessons from a Five-System Head-to-Head Evaluation
aha.org9mo ago3 min read
Cleveland Clinic’s AI Scribe Pilot: 3 Key Lessons from a Five-System Head-to-Head Evaluation

Cleveland Clinic tested five AI scribe systems in 2024 to guide deployment and ROI.

AI scribesclinical documentationCleveland Clinichealthcare technology