Architecting a Hybrid Data Gateway with FastAPI for Processing and Fastify for I/O Delivery

Architecture

Word Count: 2.8k

Read Times: 17 Min

The technical requirement was deceptively simple: construct a real-time dashboard that ingests a high-throughput stream of financial market data, applies a set of complex, computationally-intensive validation and enrichment rules, and broadcasts the results to several hundred concurrent web clients with sub-50ms latency. The data originates from an upstream message queue, and the validation rules depend on a proprietary Python library maintained by a quantitative analysis team. A pure stack approach immediately presented a series of trade-offs.

A solution built entirely in Python with FastAPI would excel at integrating the quant library and leveraging Pydantic for rigorous data validation. However, managing hundreds of persistent WebSocket connections, a fundamentally I/O-bound task, is not Python’s strongest suit, even with asyncio. The Global Interpreter Lock (GIL) remains a consideration, and the memory footprint for that many connections can become a concern.

Conversely, a pure Node.js stack using Fastify would be exceptional for the I/O-bound workload. Node’s event loop is designed for exactly this scenario: efficiently managing a large number of concurrent, non-blocking network connections. The challenge, however, would be the CPU-bound data processing. While Node.js is fast, porting the complex Python validation library or shelling out to a Python process would introduce significant performance overhead, IPC complexity, and maintenance headaches. Neither pure approach felt optimal.

The final architectural decision was a hybrid model: a distributed system composed of two specialized microservices communicating via gRPC.

FastAPI Processing Service: A Python service responsible for a single task: consuming raw data, applying the complex validation and enrichment rules using the existing quant libraries, and exposing this logic via a gRPC endpoint. This service plays to Python’s strengths in data manipulation and its rich scientific computing ecosystem.
Fastify Gateway Service: A Node.js service acting as the client-facing edge. It manages all WebSocket connections, serves the frontend static assets, and acts as a gRPC client to the FastAPI service. It handles the I/O-heavy lifting, orchestrating the flow of data from clients to the processing backend and back.

This separation of concerns allows each component to operate in its ideal runtime environment. For the frontend development experience, where rapid iteration on a complex UI is critical, Turbopack was selected to power the Next.js development server, ensuring near-instant feedback loops.

graph TD
    subgraph Browser
        A[Client UI - React/Next.js]
    end

    subgraph "Node.js Runtime"
        B[Fastify Gateway Service]
    end

    subgraph "Python Runtime"
        D[FastAPI Processing Service]
    end

    subgraph "Data Sources"
        E[Message Queue e.g., Kafka/NATS]
    end

    A -- WebSocket --> B
    B -- gRPC Request --> D
    D -- Fetches Data --> E
    E -- Raw Data --> D
    D -- gRPC Response --> B
    B -- WebSocket Push --> A

    style B fill:#333,stroke:#66f,stroke-width:2px
    style D fill:#333,stroke:#f90,stroke-width:2px

The communication backbone for this architecture is Protocol Buffers (Protobuf) and gRPC. This choice is deliberate. A RESTful API with JSON would introduce significant serialization/deserialization overhead and lacks the strong typing and streaming capabilities inherent to gRPC, which are critical for maintaining low latency.

The first step is defining the service contract. This .proto file is the canonical source of truth for all inter-service communication.

proto/market_data.proto

syntax = "proto3";

package marketdata;

// The core service definition for processing market data.
service MarketDataProcessor {
  // A bidirectional stream to process incoming ticks and send back validated trades.
  // This allows for a persistent connection between the gateway and the processor,
  // minimizing connection setup overhead for a continuous stream of data.
  rpc ProcessStream(stream RawMarketTick) returns (stream ValidatedTrade);
}

// Represents a raw data point from the upstream feed.
message RawMarketTick {
  string instrument_id = 1;
  double price = 2;
  int64 volume = 3;
  int64 timestamp_ns = 4; // Nanosecond precision timestamp
}

// Represents a validated and enriched trade object ready for client consumption.
message ValidatedTrade {
  string trade_id = 1;
  string instrument_id = 2;
  double price = 3;
  int64 volume = 4;
  int64 timestamp_ns = 5;
  ValidationStatus status = 6;
  optional string reason = 7; // Optional reason if validation fails
  double vwap_20m = 8; // Example enrichment: 20-minute Volume-Weighted Average Price
}

enum ValidationStatus {
  UNKNOWN = 0;
  VALID = 1;
  INVALID = 2;
}

With the contract defined, the next step is to generate the necessary server and client code for both Python and Node.js.

# Python code generation
python -m grpc_tools.protoc \
  -I./proto \
  --python_out=./processor_service/generated \
  --pyi_out=./processor_service/generated \
  --grpc_python_out=./processor_service/generated \
  ./proto/market_data.proto

# Node.js (TypeScript) code generation
# Using @grpc/grpc-js and @grpc/proto-loader with ts-proto for better TS support
npx protoc \
  --plugin=protoc-gen-ts_proto=./gateway_service/node_modules/.bin/protoc-gen-ts_proto \
  --ts_proto_out=./gateway_service/src/generated \
  --ts_proto_opt=outputServices=grpc-js,env=node,useOptionals=messages \
  -I ./proto \
  ./proto/market_data.proto

The FastAPI Processing Service

This service is the computational core. It’s built with grpcio and fastapi (though FastAPI’s web server component isn’t used here, its ecosystem, especially Pydantic, is invaluable for internal data modeling). The main focus is on the gRPC server implementation.

processor_service/main.py

import asyncio
import logging
from concurrent import futures
from typing import AsyncIterator

import grpc
from pydantic import BaseModel, Field

# Import generated gRPC code
from generated import market_data_pb2
from generated import market_data_pb2_grpc

# Configure structured logging
logging.basicConfig(
    level=logging.INFO,
    format='%(asctime)s - %(name)s - %(levelname)s - %(message)s'
)
logger = logging.getLogger(__name__)

# --- Pydantic Models for Internal Validation ---
# Even with Protobuf, using Pydantic internally ensures another layer
# of robust validation and business logic encapsulation.
class TickModel(BaseModel):
    instrument_id: str = Field(..., min_length=3, max_length=10)
    price: float = Field(..., gt=0)
    volume: int = Field(..., gt=0)
    timestamp_ns: int

class TradeModel(BaseModel):
    trade_id: str
    instrument_id: str
    price: float
    volume: int
    timestamp_ns: int
    status: market_data_pb2.ValidationStatus
    reason: str | None = None
    vwap_20m: float | None = None

# --- Mock Business Logic ---
# In a real-world project, this would be the complex quant library.
# Here we simulate some validation and enrichment.
class BusinessLogicEngine:
    async def validate_and_enrich(self, tick: TickModel) -> TradeModel:
        # Simulate a CPU-intensive or IO-bound task
        await asyncio.sleep(0.01) # Simulates DB lookup or computation
        
        status = market_data_pb2.VALID
        reason = None
        
        # Example validation rule: price must be within a certain range
        if tick.price > 10000 or tick.price < 0.01:
            status = market_data_pb2.INVALID
            reason = "Price out of acceptable range"

        # Example enrichment: calculate a mock VWAP
        # In reality, this would involve complex stateful calculations.
        vwap = tick.price * 1.05

        return TradeModel(
            trade_id=f"trade_{tick.timestamp_ns}",
            instrument_id=tick.instrument_id,
            price=tick.price,
            volume=tick.volume,
            timestamp_ns=tick.timestamp_ns,
            status=status,
            reason=reason,
            vwap_20m=vwap if status == market_data_pb2.VALID else None
        )

engine = BusinessLogicEngine()

# --- gRPC Servicer Implementation ---
class MarketDataProcessorServicer(market_data_pb2_grpc.MarketDataProcessorServicer):
    async def ProcessStream(
        self,
        request_iterator: AsyncIterator[market_data_pb2.RawMarketTick],
        context: grpc.aio.ServicerContext
    ) -> AsyncIterator[market_data_pb2.ValidatedTrade]:
        
        client_peer = context.peer()
        logger.info(f"Bidirectional stream opened with client: {client_peer}")
        
        try:
            async for tick_proto in request_iterator:
                try:
                    # Convert protobuf message to Pydantic model for validation
                    tick_model = TickModel(
                        instrument_id=tick_proto.instrument_id,
                        price=tick_proto.price,
                        volume=tick_proto.volume,
                        timestamp_ns=tick_proto.timestamp_ns
                    )
                    
                    logger.debug(f"Received valid tick model: {tick_model.instrument_id}")

                    # Execute the core business logic
                    validated_trade_model = await engine.validate_and_enrich(tick_model)

                    # Convert the result back to a protobuf message and yield it
                    # This sends the message back to the client over the stream.
                    yield market_data_pb2.ValidatedTrade(
                        trade_id=validated_trade_model.trade_id,
                        instrument_id=validated_trade_model.instrument_id,
                        price=validated_trade_model.price,
                        volume=validated_trade_model.volume,
                        timestamp_ns=validated_trade_model.timestamp_ns,
                        status=validated_trade_model.status,
                        reason=validated_trade_model.reason,
                        vwap_20m=validated_trade_model.vwap_20m,
                    )

                except Exception as e:
                    # Handle Pydantic validation errors or other processing exceptions
                    logger.error(f"Error processing tick: {e}", exc_info=True)
                    # Send an error message back to the gateway
                    yield market_data_pb2.ValidatedTrade(
                        instrument_id=tick_proto.instrument_id,
                        status=market_data_pb2.INVALID,
                        reason=f"Processing error: {str(e)}",
                        timestamp_ns=tick_proto.timestamp_ns,
                    )
        except grpc.aio.AioRpcError as e:
            logger.warning(f"Client stream closed unexpectedly: {e.details()}")
        finally:
            logger.info(f"Bidirectional stream closed with client: {client_peer}")


async def serve() -> None:
    server = grpc.aio.server(futures.ThreadPoolExecutor(max_workers=10))
    market_data_pb2_grpc.add_MarketDataProcessorServicer_to_server(
        MarketDataProcessorServicer(), server
    )
    listen_addr = '[::]:50051'
    server.add_insecure_port(listen_addr)
    logger.info(f"Starting gRPC server on {listen_addr}")
    await server.start()
    await server.wait_for_termination()

if __name__ == '__main__':
    asyncio.run(serve())

A crucial part of production code is testing. Here’s a small test for our gRPC service using grpc-testing.

processor_service/test_processor.py

import asyncio
import pytest
import grpc
from unittest.mock import patch, AsyncMock

from main import MarketDataProcessorServicer
from generated import market_data_pb2

@pytest.mark.asyncio
async def test_process_stream_valid_tick():
    servicer = MarketDataProcessorServicer()

    # Create a mock async iterator for the request
    async def request_generator():
        yield market_data_pb2.RawMarketTick(
            instrument_id="TEST.TICK",
            price=150.5,
            volume=100,
            timestamp_ns=1234567890
        )

    mock_context = AsyncMock(spec=grpc.aio.ServicerContext)
    
    # Mock the business logic to have a predictable outcome
    with patch('main.engine.validate_and_enrich', new_callable=AsyncMock) as mock_enrich:
        mock_enrich.return_value.status = market_data_pb2.VALID
        mock_enrich.return_value.instrument_id = "TEST.TICK"
        mock_enrich.return_value.vwap_20m = 158.0

        response_iterator = servicer.ProcessStream(request_generator(), mock_context)
        
        # Get the first (and only) result
        result = await anext(response_iterator)

        assert result.status == market_data_pb2.VALID
        assert result.instrument_id == "TEST.TICK"
        assert result.vwap_20m > 157.0

@pytest.mark.asyncio
async def test_process_stream_invalid_price():
    servicer = MarketDataProcessorServicer()

    async def request_generator():
        yield market_data_pb2.RawMarketTick(
            instrument_id="BAD.TICK",
            price=99999.99, # This should fail validation
            volume=50,
            timestamp_ns=987654321
        )
    
    mock_context = AsyncMock(spec=grpc.aio.ServicerContext)
    response_iterator = servicer.ProcessStream(request_generator(), mock_context)
    result = await anext(response_iterator)

    assert result.status == market_data_pb2.INVALID
    assert "Price out of acceptable range" in result.reason

The Fastify Gateway Service

This is the Node.js/TypeScript component that faces the clients. It uses Fastify for its raw performance, @grpc/grpc-js to communicate with the Python service, and fastify-websocket to manage client connections.

gateway_service/src/server.ts

import Fastify from 'fastify';
import websocketPlugin from '@fastify/websocket';
import staticPlugin from '@fastify/static';
import path from 'path';
import { credentials, Metadata } from '@grpc/grpc-js';
import { MarketDataProcessorClient } from './generated/market_data';
import { RawMarketTick, ValidatedTrade } from './generated/market_data';
import { GrpcTransport } from '@protobuf-ts/grpc-transport';
import { pino } from 'pino';

const logger = pino({ level: 'info' });

const GRPC_SERVER_ADDRESS = process.env.GRPC_SERVER_ADDRESS || 'localhost:50051';

// --- gRPC Client Setup ---
// Establishes a persistent connection to the Python processing service.
const transport = new GrpcTransport({
    host: GRPC_SERVER_ADDRESS,
    channelCredentials: credentials.createInsecure(),
});
const grpcClient: MarketDataProcessorClient = new MarketDataProcessorClient(transport);

const fastify = Fastify({ logger });

// --- Plugin Registration ---
fastify.register(websocketPlugin);
// Serve the static frontend assets (built by Turbopack/Next.js)
fastify.register(staticPlugin, {
  root: path.join(__dirname, '../public'), // Assuming a 'public' folder after build
  prefix: '/',
});

fastify.get('/health', async (request, reply) => {
  return { status: 'ok' };
});

// --- WebSocket Handling Logic ---
fastify.register(async function (fastifyInstance) {
  fastifyInstance.get('/ws/marketdata', { websocket: true }, (connection, req) => {
    logger.info('Client connected via WebSocket.');

    const metadata = new Metadata();
    metadata.set('client-id', req.id);
    
    // Establish the bidirectional gRPC stream for this WebSocket connection
    const grpcStream = grpcClient.processStream(metadata);

    // --- Data flow: gRPC -> WebSocket ---
    // Listen for messages from the Python service
    grpcStream.responses.on("data", (trade: ValidatedTrade) => {
      // A common mistake is to not handle the case where the websocket might be closing.
      // Always check the readyState.
      if (connection.socket.readyState === connection.socket.OPEN) {
        // Serialize the validated trade and send it to the web client
        connection.socket.send(JSON.stringify(trade));
      }
    });

    grpcStream.responses.on("error", (err: Error) => {
        logger.error({ err }, 'gRPC response stream error.');
        connection.socket.close(1011, 'Backend processing error');
    });

    grpcStream.responses.on("end", () => {
        logger.info('gRPC response stream ended.');
        connection.socket.close();
    });

    // --- Data flow: WebSocket -> gRPC ---
    // Listen for messages from the web client
    connection.socket.on('message', (message: string) => {
      try {
        const data = JSON.parse(message);
        
        // A real-world project would have robust validation here (e.g., using Zod)
        // before sending data to the gRPC service.
        const tick: RawMarketTick = {
          instrumentId: data.instrumentId,
          price: data.price,
          volume: data.volume,
          timestampNs: BigInt(Date.now()) * BigInt(1000000), // Create nanosec timestamp
        };

        // Write the tick into the gRPC stream to be processed by Python.
        grpcStream.requests.send(tick);

      } catch (err) {
        logger.warn({ err, msg: message.toString() }, 'Received invalid WebSocket message.');
      }
    });

    connection.socket.on('close', (code, reason) => {
      logger.info({ code, reason: reason.toString() }, 'Client WebSocket closed.');
      // It's critical to clean up the gRPC stream when the client disconnects.
      // Calling 'sendComplete' signals the Python service that this client is done.
      grpcStream.requests.sendComplete();
    });

    connection.socket.on('error', (err) => {
        logger.error({ err }, 'WebSocket connection error.');
        grpcStream.requests.sendComplete(); // Clean up on error
    });
  });
});

// --- Server Start ---
const start = async () => {
  try {
    await fastify.listen({ port: 3000, host: '0.0.0.0' });
  } catch (err) {
    fastify.log.error(err);
    process.exit(1);
  }
};

start();

Frontend Integration with Turbopack

While the backend architecture is complex, the frontend development workflow must remain fast. Using Next.js with Turbopack (next dev --turbo) provides an almost instantaneous development server. The key configuration is setting up a proxy to channel WebSocket requests to the Fastify backend during development, avoiding CORS issues.

gateway_service/next.config.js

/** @type {import('next').NextConfig} */
const nextConfig = {
  // In a real project, this would be more robust,
  // likely checking process.env.NODE_ENV
  async rewrites() {
    return [
      {
        source: '/ws/marketdata',
        destination: 'http://localhost:3000/ws/marketdata', // Proxy to Fastify
      },
    ];
  },
};

module.exports = nextConfig;

A minimal React component demonstrates the client-side logic.

gateway_service/app/page.tsx

"use client";

import { useState, useEffect, useRef } from 'react';

// Define the shape of the data we expect from the WebSocket
interface ValidatedTrade {
  tradeId: string;
  instrumentId: string;
  price: number;
  volume: string; // BigInt comes as string in JSON
  status: "VALID" | "INVALID";
  reason?: string;
  vwap20m?: number;
}

export default function Home() {
  const [trades, setTrades] = useState<ValidatedTrade[]>([]);
  const [connectionStatus, setConnectionStatus] = useState('Disconnected');
  const ws = useRef<WebSocket | null>(null);

  useEffect(() => {
    // Construct the WebSocket URL based on the window location.
    // This makes it work in both development (http) and production (https -> wss).
    const wsProtocol = window.location.protocol === 'https:' ? 'wss:' : 'ws:';
    const wsUrl = `${wsProtocol}//${window.location.host}/ws/marketdata`;

    ws.current = new WebSocket(wsUrl);

    ws.current.onopen = () => {
      setConnectionStatus('Connected');
      // Simple mock message sender to test the pipeline
      setInterval(() => {
        if (ws.current?.readyState === WebSocket.OPEN) {
          const mockTick = {
            instrumentId: "BTC/USD",
            price: 50000 + (Math.random() - 0.5) * 100,
            volume: Math.floor(Math.random() * 10) + 1,
          };
          ws.current.send(JSON.stringify(mockTick));
        }
      }, 2000);
    };

    ws.current.onclose = () => setConnectionStatus('Disconnected');
    ws.current.onerror = () => setConnectionStatus('Error');

    ws.current.onmessage = (event) => {
      try {
        const trade: ValidatedTrade = JSON.parse(event.data);
        setTrades(prevTrades => [trade, ...prevTrades.slice(0, 99)]); // Keep last 100 trades
      } catch (error) {
        console.error("Failed to parse WebSocket message", error);
      }
    };

    // Cleanup function to close the WebSocket connection on component unmount
    return () => {
      ws.current?.close();
    };
  }, []);

  return (
    <main>
      <h1>Real-Time Market Data</h1>
      <p>Connection Status: {connectionStatus}</p>
      <table>
        <thead>
          <tr>
            <th>ID</th>
            <th>Instrument</th>
            <th>Price</th>
            <th>Status</th>
            <th>Reason</th>
          </tr>
        </thead>
        <tbody>
          {trades.map((trade) => (
            <tr key={trade.tradeId} style={{ color: trade.status === 'INVALID' ? 'red' : 'green' }}>
              <td>{trade.tradeId.slice(-8)}</td>
              <td>{trade.instrumentId}</td>
              <td>{trade.price.toFixed(2)}</td>
              <td>{trade.status}</td>
              <td>{trade.reason || 'N/A'}</td>
            </tr>
          ))}
        </tbody>
      </table>
    </main>
  );
}

This hybrid architecture, while introducing the operational complexity of a distributed system, solves the initial problem effectively. It leverages the best tool for each specific job: Python for its mature data science ecosystem and FastAPI/Pydantic for robust data modeling, Node.js with Fastify for its world-class I/O performance, and Turbopack for maintaining developer velocity on the frontend.

The key limitation of this pattern is the added architectural surface area. Debugging now requires distributed tracing to follow a request from the client’s WebSocket message through the Fastify gateway, across the gRPC boundary to the FastAPI service, and back. The Protobuf contract becomes a critical dependency that requires careful versioning and management. Furthermore, the system is now subject to network partitioning between the services, necessitating robust retry and circuit-breaker logic in the Fastify gRPC client for true production resilience, which has been omitted here for clarity. This approach is not a universal solution, but for problems that sit at the intersection of heavy I/O and heavy computation, it presents a pragmatic and highly performant compromise.

Python gRPC FastAPI Fastify Turbopack Microservices Node.js

Building a Resilient MLOps Compute Plane Using Ray Cluster Discovery with Consul and State Management with CockroachDB

2023-10-27 MLOps

Distributed Systems Ray Consul CockroachDB MLOps

Constructing an Immutable ML Serving Image with Packer, BentoML, and an Embedded LevelDB Metadata Cache

2023-10-27 MLOps

Packer BentoML MLOps LevelDB Pinecone Scikit-learn Immutable Infrastructure