What is Per-Second Billing | FinOps Glossary

How It Works

Most cloud providers historically billed compute time in full-minute or full-hour blocks. Even if a workload ran for 61 seconds, customers paid for two full minutes. Per-second billing eliminates that rounding tax. The clock starts when a function or instance begins execution and stops the moment it completes. AWS Lambda and AWS Fargate both use per-second billing, with a short minimum charge applied to very brief executions. GCP charges Compute Engine VMs by the second after a one-minute minimum. Azure applies per-second billing to several compute services as well.

Why It Matters for Cloud Cost

For teams running short-lived or variable workloads, per-second billing directly reduces waste. Serverless functions that process events in under a minute, batch jobs that spin up and down frequently, and container tasks with unpredictable runtimes all benefit from not paying for unused time. At scale, the savings from eliminating billing rounding can be meaningful, particularly for high-frequency Lambda invocations or Fargate tasks. Teams that ignore this model often overprovision to avoid partial-minute penalties, which compounds the waste further.

Usage AI’s Flex Savings Plan covers EC2, Fargate, and Lambda, saving 40 to 60% versus on-demand with no upfront cost and a full buyback guarantee on any underutilization.

See how Usage AI saves 30 to 50% on AWS, GCP, and Azure.

Per-Second Billing

How It Works

Why It Matters for Cloud Cost

Provisioned IOPS

Private Pricing Agreement

PaaS (Platform as a Service)