Skip to content
STAGING — not production
Reference

Glossary

Trading infrastructure and low-latency systems, defined by a practitioner.

61 terms

A

AF_XDP

A Linux socket address family that enables zero-copy packet processing by mapping NIC receive queues directly into user-space memory, bypassing the kernel network stack entirely. Unlike DPDK, AF_XDP works with standard kernel drivers and retains Linux tooling compatibility while still achieving sub-microsecond per-packet latency on modern NICs.

AF_XDP Kernel Bypass
Arbitrage

The simultaneous purchase and sale of an asset across two or more venues to profit from a price discrepancy. In practice, pure arbitrage windows collapse in microseconds on efficient venues; durable edge comes from structural information advantages — co-located feeds, superior order routing, or proprietary liquidity access — not just price observation.

Attestation (TEE)

A cryptographic proof that a specific binary is running inside a genuine hardware enclave, generated by the CPU itself and verifiable by any third party without trusting the host operator. AWS Nitro enclaves produce signed attestation documents containing a PCR (Platform Configuration Register) measurement that uniquely identifies the exact code being executed.

Trusted Execution Environments

B

Block Builder

A specialized actor in Ethereum's post-Merge block production pipeline that assembles the most profitable ordered set of transactions into a block and bids for it via MEV-Boost relays. Builders compete on simulation speed and MEV extraction sophistication — the best builders run thousands of transaction simulations per second to find the highest-value ordering before the 12-second slot deadline.

SUAVE: Decentralized Block Building
Busy Polling

A latency reduction technique where the application continuously spins in a tight loop checking for new data rather than sleeping and waiting for an interrupt. Interrupts introduce jitter of 5–50µs because they context-switch the CPU; busy polling eliminates that at the cost of one dedicated CPU core running at 100% utilization.

CPU Optimization for Linux Latency
BPF / eBPF

Extended Berkeley Packet Filter is a kernel subsystem that lets you run sandboxed programs inside the Linux kernel at JIT-compiled native speed without modifying kernel source or loading kernel modules. In trading infrastructure it powers zero-overhead packet filtering, latency tracing via kprobes, and XDP fast-path forwarding — all without the instability risk of a kernel patch.

The Observer Effect: eBPF Profiling

C

C-states

CPU power-saving sleep levels ranging from C0 (active) to C6+ (deep sleep). Deeper C-states save more power but impose wake-up penalties: C1 costs ~2µs, C3 costs ~50µs, C6 costs 100µs or more. A trading thread waiting for a packet in C3 pays a 50µs tax before it can even read the data — effectively adding half a microsecond to every fill.

CPU Optimization for Linux Latency
Circuit Breaker

A risk control mechanism that halts trading automatically when a predefined loss threshold, error rate, or anomalous condition is breached, preventing a runaway algorithm from catastrophic exposure. Unlike a simple limit check, a well-designed circuit breaker counts consecutive failures and transitions through open/half-open/closed states so the system can recover gracefully rather than requiring manual restart.

Trading Risk Management
Colocation

Housing your trading servers inside the same data center — often the same rack row — as an exchange's matching engine, shrinking the network round-trip from hundreds of milliseconds (remote cloud) to under a microsecond (cross-connect cable). Top-tier colo space within 100 meters of a matching engine costs $50k–$150k/month and is the single highest-ROI infrastructure investment a professional HFT firm makes.

Trading Colocation Explained
CPU Isolation (isolcpus)

A Linux boot parameter that instructs the kernel scheduler to exclude specified cores from its normal work queue, reserving them exclusively for processes pinned there with taskset or pthread_setaffinity_np. Paired with nohz_full (which disables the per-CPU timer tick) and rcu_nocbs (which offloads RCU callbacks), isolation can reduce P99 jitter from 50µs to under 5µs on a dedicated trading core.

CPU Optimization for Linux Latency
Cold Storage

Private key material that has never been exposed to a networked device and is held on air-gapped hardware (HSMs, paper, or offline signing devices). In custodial trading infrastructure, cold storage governs settlement-layer assets while a hot wallet with strict position limits handles intraday liquidity — the split is a deliberate blast-radius containment design.

Wallet Types Explained

D

DPDK

Data Plane Development Kit is a set of libraries and drivers that moves all packet I/O from the Linux kernel into user space via polling, replacing interrupt-driven processing. DPDK achieves line-rate forwarding at 14 million packets per second on a single core by eliminating context switches, memory copies, and kernel syscalls — standard in production HFT feed handlers and co-located gateways.

Linux Networking Stack
Deterministic Execution

A system property where the same inputs always produce the same outputs in the same time, with no variance introduced by garbage collection, OS scheduling jitter, or non-deterministic memory allocation. TEE-based trading enclaves and real-time kernel builds prioritize determinism over throughput because unpredictable latency spikes — not average latency — cause missed fills and unexpected slippage.

Trusted Execution Environments

E

ECDSA

Elliptic Curve Digital Signature Algorithm is the signing scheme used by Bitcoin, Ethereum, and most modern blockchains to prove transaction authorization. The signing operation itself takes roughly 70–200µs in software; in latency-sensitive custody systems, HSMs with hardware ECDSA acceleration bring this below 10µs while keeping private keys in tamper-proof storage.

Cryptography Basics
Edge Computing

Processing data at or near the point of origin rather than routing it to a central cloud region, reducing network round-trip time and eliminating the ~20–100ms tax of traversing a continent. For DeFi trading systems this means running validation and signing logic on a server co-located with the RPC node, not in a data center on the other side of the Atlantic.

Enclave (TEE)

A hardware-isolated execution environment where code and data are encrypted in memory and inaccessible to the host OS, hypervisor, or even the cloud provider's engineers. AWS Nitro Enclaves, Intel SGX, and AMD SEV each implement this isolation differently, but all provide the same guarantee: a remote party can verify exactly what binary is running inside before trusting it with key material.

Nitro Enclave: Sovereign Pod
EVM

The Ethereum Virtual Machine is the sandboxed runtime that executes smart contract bytecode on every full node in the Ethereum network, deterministically computing the same state transition given the same inputs. EVM execution speed caps out around 15–30M gas per block (roughly 1,000–3,000 complex operations per second), which is why high-frequency on-chain logic must be aggressively gas-optimized or moved off-chain with only settlement on-chain.

Ethereum Basics

F

FPGA

A Field-Programmable Gate Array is a reconfigurable silicon chip where logic is implemented directly in hardware rather than as software running on a CPU. Top-tier HFT firms implement feed parsing, order encoding, and market-data normalization in FPGA, achieving 50–200ns round-trips that are physically impossible for any CPU-based software system because there is no instruction fetch cycle.

FPGAs in Trading Infrastructure
Fill Latency

The elapsed time from when your order is accepted by the exchange to when an execution report confirming the fill arrives back at your system — a critical metric because it determines how stale your position knowledge is during fast markets. Measuring fill latency accurately requires hardware timestamps at both the outbound order and the inbound execution report, not wall-clock time in application code.

What Is Latency?
Flash Loan

An uncollateralized loan that must be borrowed and repaid within the same transaction block, enforced by a smart contract that reverts the entire transaction if the debt is not returned plus fee. Flash loans are not inherently exploitative — they enable capital-efficient arbitrage, collateral swaps, and liquidations — but they do amplify the blast radius of a vulnerable DeFi protocol by making arbitrary capital available atomically.

Flash Loans Explained

G

Gas (Ethereum)

A unit that measures computational work in the Ethereum EVM, charged per opcode executed and priced in gwei (10⁻⁹ ETH). Gas serves as a Sybil-resistance mechanism and a real-time market for block space: during congestion, traders paying higher gas tips are included first, making gas optimization a direct lever on execution quality for on-chain trading strategies.

Gas Fees Explained

H

HFT (High-Frequency Trading)

A class of algorithmic trading characterized by very high order rates, extremely short holding periods (often milliseconds to minutes), and infrastructure optimized to operate at the physical limits of network latency. HFT profitability comes from reacting to market microstructure signals faster than competitors, making colocation, kernel bypass, and hardware timestamping prerequisites rather than optimizations.

HFT Basics
Hot Wallet

A private key that is loaded into a networked, online system and available for real-time signing of transactions. Hot wallets are necessary for any automated on-chain trading system but must be scoped to strict position limits and balance caps — the compromise of a hot wallet should be a quantified, tolerable loss event, not an existential one.

Wallet Types Explained
HugePages

2MB or 1GB memory pages that replace the standard 4KB page size, dramatically reducing TLB (Translation Lookaside Buffer) pressure for large working sets. An order book with 1GB of data needs 262,144 standard pages but only 512 HugePages — the entire working set fits in TLB, eliminating the 10–100 cycle penalty per TLB miss that compounds on every lookup.

Memory Tuning for Linux Latency
HSM (Hardware Security Module)

A tamper-evident, FIPS-certified hardware device that stores private key material in secure silicon and performs cryptographic operations internally so the key never leaves the device. HSMs enforce access control policies at the hardware level and produce an immutable audit log of every signing operation — non-negotiable in regulated custody and required for MPC threshold signing schemes in production.

Secret Management Basics

I

IRQ Affinity

A Linux configuration that pins hardware interrupt handling (NIC, storage) to specific CPUs via the smp_affinity mask in /proc/irq/N/smp_affinity. If a NIC IRQ lands on the same CPU as your trading thread it interrupts the hot path; if it lands on a cross-NUMA socket it causes remote memory access. Pinning IRQs to dedicated CPUs adjacent to — but separate from — trading cores eliminates both failure modes.

Linux Interrupts Basics
io_uring

A Linux kernel I/O interface introduced in 5.1 that uses shared ring buffers between kernel and user space to submit and complete I/O operations without per-operation syscalls, achieving near-zero overhead asynchronous I/O. For trading systems logging tick data or writing WAL records, io_uring can increase I/O throughput 3–5x versus epoll while reducing CPU utilization by eliminating redundant context switches.

Storage I/O and Linux Latency
Impermanent Loss

The opportunity cost incurred by a liquidity provider in an AMM pool compared to simply holding the underlying assets, caused by the AMM's constant-product rebalancing mechanism buying the depreciating asset as prices diverge. The loss is "impermanent" only if prices revert; in trending markets it is a realized drain on LP returns that market makers must model explicitly against fee income to determine whether providing liquidity is profitable.

Liquidity Pools

J

Jitter

Variance in latency — the difference between your best-case and worst-case response time — and the metric that kills trading performance more often than average latency. A system with 10µs average but 500µs P99.9 jitter will miss fills on 1 in 1,000 orders during fast markets; eliminating jitter sources (IRQs, scheduler preemption, GC pauses, THP compaction) is the primary goal of kernel tuning in HFT environments.

What Is Latency?

K

Kernel Bypass

Any technique that routes network I/O directly between the NIC and user-space memory without passing through the Linux kernel network stack, eliminating the 5+ context switches and multiple memory copies that normally cost 15–30µs per packet. DPDK, AF_XDP, and RDMA are the primary implementations; each trades away standard kernel tooling and TCP compatibility in exchange for deterministic low latency.

AF_XDP Kernel Bypass
KMS (Key Management Service)

A managed service (AWS KMS, GCP Cloud KMS, HashiCorp Vault) that stores cryptographic keys in HSM-backed hardware and exposes them only through authenticated API calls that log every use, so the raw key material never appears in application memory. In TEE-based architectures, the enclave's attestation document is presented to KMS as proof of identity before it will release a data key, creating a hardware-rooted trust chain.

Secret Management Basics

L

Latency

The elapsed time between an event occurring and your system reacting to it — not throughput, not bandwidth, but the wall-clock delay experienced by a single unit of work. In trading infrastructure, what matters is not average latency but the tail: P99 and P99.9 percentiles determine how often you miss fills during price moves, making latency measurement methodology as important as the optimization itself.

What Is Latency?
Liquidation

The forced closure of a leveraged position by a protocol or exchange when its collateral value falls below the required maintenance margin, typically triggering an on-chain auction or market sell to recover the outstanding debt. Liquidators in DeFi compete to be first to call the liquidation function — a race that rewards the fastest submitter with a bonus and has driven demand for MEV-aware execution infrastructure.

DeFi Risk Management
LVR (Loss-Versus-Rebalancing)

A rigorous financial metric that quantifies how much value AMM liquidity providers lose to informed order flow compared to a continuously rebalanced portfolio tracking the same asset mix — a more precise alternative to "impermanent loss" that accounts for the time value of rebalancing. LVR grows with price volatility and is the primary reason sophisticated market makers prefer off-chain CLOBs to passive AMM liquidity provision.

AMMs Explained

M

Market Making

A trading strategy that continuously posts resting limit orders on both sides of the order book, profiting from the bid-ask spread while providing liquidity to the market. Profitability depends on inventory risk management (being long or short too long is costly in volatile markets), adverse selection (losing to better-informed traders), and fill rate — all of which are functions of speed, model accuracy, and fee tier.

Market Making Introduction
MEV (Maximal Extractable Value)

Profit extracted by block producers or searchers through the ability to reorder, insert, or censor transactions within a block — including frontrunning, sandwich attacks, and liquidation capture. MEV is not a bug but an emergent property of transparent mempools and validator discretion; protecting against it requires private transaction routing (Flashbots Protect, MEV Blocker) or intent-based architectures like OFAs.

What Is MEV?
MPC (Multi-Party Computation)

A cryptographic protocol that distributes a private key across N parties such that any threshold t of them must collaborate to produce a valid signature, and no individual party ever holds or learns the full key. MPC-based custody eliminates the single point of failure of traditional key storage without the operational complexity of multisig on-chain — signatures are generated off-chain and appear identical to standard ECDSA signatures to the blockchain.

mTLS

Mutual TLS is an extension of standard TLS where both the client and server present and verify X.509 certificates, establishing bidirectional cryptographic identity before any data is exchanged. In microservice trading infrastructure, mTLS replaces shared secrets and IP-allowlist access controls with per-service certificates managed by a PKI, so a compromised service cannot impersonate another without its private key.

TLS/SSL Explained

N

NATS

A lightweight, high-performance open-source messaging system that supports pub/sub, request/reply, and queue worker patterns over a simple text protocol, with a server written in Go that fits in a single binary. NATS achieves sub-millisecond message delivery at millions of messages per second and is used in trading infrastructure as the nervous system between data feeds, strategy engines, risk systems, and execution gateways.

NUMA

Non-Uniform Memory Access describes multi-socket server architectures where each CPU socket has locally attached RAM that it accesses in ~70ns, while accessing the other socket's RAM costs ~130ns due to the cross-socket interconnect. In trading systems, NUMA-unaware deployments where a trading thread on socket 0 reads order book data from socket 1's memory pay a 60ns tax on every cache miss — trivial per access but catastrophic at scale.

Memory Tuning for Linux Latency
Nitro Enclave

AWS's implementation of Trusted Execution Environments using the Nitro hypervisor to carve out an isolated virtual machine with no persistent storage, no interactive access, and no external network — communicating with the parent EC2 instance only via VSOCK. The Nitro Security Chip produces a signed attestation document containing SHA-384 PCR measurements of the exact enclave image, enabling verifiable key release from AWS KMS.

Nitro Enclave: Sovereign Pod

O

Order Book

A real-time list of all outstanding resting buy (bid) and sell (ask) limit orders for an instrument, organized by price level and showing quantity available at each price. The order book is the primary source of market microstructure signal in HFT — imbalances between bid and ask depth predict short-term price direction, and reconstructing it with sub-millisecond accuracy from exchange feeds is a non-trivial engineering challenge.

Order Book Basics
OFA (Order Flow Auction)

A mechanism where user transaction intents are submitted to an off-chain auction before on-chain settlement, allowing solvers to compete to fill the order at the best price while routing any captured MEV back to the user rather than validators. OFAs like CoW Protocol and SUAVE represent a structural shift from permissioned block building to competitive, MEV-redistributing execution for retail and institutional DeFi traders.

SUAVE: Decentralized Block Building

P

P99 / P99.9

The 99th and 99.9th percentile latency values — the latency that 99% or 99.9% of requests fall under, representing tail behavior rather than average. P99.9 means 1 in 1,000 requests is slower; at 1 million orders per day that is 1,000 worst-case events. Profiling averages in trading systems is misleading; P99.9 is where slippage, missed fills, and risk system failures live.

What Is Latency?
PTP (Precision Time Protocol)

IEEE 1588 Precision Time Protocol synchronizes clocks across a network to sub-microsecond accuracy using hardware timestamping on switches and NICs to account for propagation delay. Without PTP (or its simpler cousin NTP which achieves only ~1ms accuracy), cross-system latency measurements are meaningless — you cannot compare a timestamp from one machine with a timestamp from another unless both are synchronized to the same nanosecond-precision clock.

PTP Time Synchronization
PREEMPT_RT

A Linux kernel patch set that converts most of the kernel's non-preemptible critical sections into preemptible ones, bounding worst-case interrupt latency to under 100µs on commodity hardware (versus potentially unbounded on a stock kernel). PREEMPT_RT is the standard kernel configuration for soft real-time trading systems on Linux that cannot justify DPDK's operational complexity but need predictable tail latency.

Hidden Linux Settings for Latency

R

RTT (Round-Trip Time)

The time for a message to travel from sender to receiver and for the acknowledgment to return — the fundamental latency unit for any request/response protocol. In exchange connectivity, RTT determines how quickly your system can learn the result of an order submission; a 10µs RTT advantage over a competitor translates directly to priority at the matching engine during periods of simultaneous order arrival.

What Is Latency?
Reorg

A blockchain reorganization occurs when the canonical chain tip is replaced by a competing fork of equal or greater cumulative work, causing transactions in the discarded blocks to revert as if they never happened. For trading systems, reorgs invalidate fills and can cause double-execution of orders if the system does not track finality depth — on Ethereum, waiting 12 confirmations (2.4 minutes) provides economic finality guarantees sufficient for most trading applications.

Blockchain Consensus Mechanisms

S

Signing Latency

The time required for a cryptographic signing operation to complete — a critical bottleneck in on-chain trading where every transaction must be signed before submission. Software ECDSA signing takes 70–200µs; HSM-based signing adds network round-trip overhead for 1–5ms total; TEE-based signing inside an enclave with a pre-loaded key achieves 50–100µs while preserving hardware isolation guarantees.

Nitro Enclave: Sovereign Pod
Slippage

The difference between the expected fill price when an order is placed and the actual price at which it executes, caused by market movement during transmission latency or by walking the order book when order size exceeds available liquidity at the best price. Slippage is not random error — it is a quantifiable function of order size, spread, depth, and fill latency, and should be explicitly modeled in every strategy's cost basis.

Trading Order Types
SoftIRQ

Linux's deferred interrupt processing mechanism that runs network packet processing, timer callbacks, and block I/O completion in a software interrupt context after the hardware IRQ has acknowledged the device. SoftIRQ processing competes with application threads for CPU time unless explicitly managed via IRQ affinity — on an untuned kernel, SoftIRQ storms during packet bursts can introduce 50–200µs jitter spikes on co-located CPUs.

Linux Interrupts Basics
StatefulSet

A Kubernetes workload controller for applications that require stable network identity and persistent storage across pod restarts, unlike Deployments which treat pods as interchangeable. Trading system components like order management systems, NATS JetStream nodes, and time-series databases use StatefulSets so that each replica retains its identity, storage volume, and sequence position after rescheduling.

Kubernetes StatefulSets for Trading

T

TEE (Trusted Execution Environment)

A hardware-enforced isolated computation environment where code runs in encrypted memory that is inaccessible to the host OS, hypervisor, or cloud operator, and where the CPU generates a cryptographic attestation proof of exactly what code is executing. TEEs enable verifiable, operator-proof execution of trading logic and key management — a counterparty can cryptographically verify your system before trusting it with capital, without you revealing source code.

Trusted Execution Environments
THP (Transparent Huge Pages)

A Linux kernel feature that automatically promotes 4KB pages to 2MB huge pages without application changes, aiming to reduce TLB pressure transparently. In trading systems, THP is almost universally disabled because its background compaction daemon (khugepaged) causes unpredictable latency spikes of 1–10ms while defragmenting memory — the exact opposite of what a low-jitter trading system needs.

Memory Tuning for Linux Latency
TLB (Translation Lookaside Buffer)

A small, fast hardware cache inside the CPU that stores recent virtual-to-physical address translations, avoiding a full page table walk (which costs 100–300 cycles) on every memory access. TLB capacity is fixed — typically 64 entries for large pages and 1,500 for small pages — making TLB pressure a primary driver of memory access latency in trading systems with large order books or market data caches.

Linux Memory Management
TWAP

Time-Weighted Average Price is an execution algorithm that splits a large order into smaller child orders spread evenly over a time horizon, aiming to match the average price over that period rather than moving the market with a single large trade. TWAP is the canonical institutional execution algorithm for minimizing market impact on large orders where predictable, low-slippage fills matter more than execution speed.

V

VSOCK

Virtual Socket is a Linux communication mechanism that allows a guest VM (or Nitro Enclave) to communicate with its hypervisor host over a local, high-throughput channel without a network interface, TCP stack, or external IP address. AWS Nitro Enclaves use VSOCK as the only permitted communication channel between the enclave and its parent EC2 instance, enforcing the enclave's network isolation guarantee.

Nitro Enclave: Sovereign Pod
Validator

In Ethereum's Proof-of-Stake consensus, a node that has deposited 32 ETH and participates in block proposal and attestation duties to extend the canonical chain, earning rewards for honest participation and facing slashing penalties for equivocation. Validators are the MEV extraction point in Ethereum post-Merge because they select which transactions to include and can accept bids from block builders via MEV-Boost to maximize their returns.

Blockchain Consensus Mechanisms

W

WAL (Write-Ahead Log)

A durability technique where every database change is first recorded sequentially to an append-only log on disk before being applied to the main data structure, ensuring the database can replay the log to recover a consistent state after a crash. SQLite in WAL mode allows concurrent readers while a writer appends — critical for trading systems where a risk monitor must read position state while the OMS is writing fills.

Z

Zero-Copy

A data transfer technique where the CPU moves data from source to destination without copying it through intermediate kernel or user-space buffers — the data travels directly from NIC DMA to application memory (or from application memory to NIC DMA). Zero-copy eliminates both the memory bandwidth consumption and the CPU cycles of redundant copies, which at 10Gbps line rates represents millions of wasted bytes per second.

Rust Zero-Copy Serialization
Zero Trust

A security architecture that assumes no implicit trust based on network location — every request from every service must be authenticated, authorized, and encrypted regardless of whether it originates inside or outside a perimeter. In trading infrastructure this means mTLS between all microservices, short-lived credentials managed by Vault or AWS IAM Roles, and continuous verification rather than perimeter firewall reliance.

What Is Zero Trust?

Learn one term per week, in depth

The Infrastructure Edge newsletter.

Free forever. Unsubscribe anytime.

You're in. Check your inbox.