What are the key performance statistics of the system?

Question

Answers ( 1 )

    0
    2025-03-31T18:39:41+00:00

    Key performance statistics include:
    - Total input tokens: 608B (56.3% cache hit rate).
    - Total output tokens: 168B.
    - Average output speed: 20-22 tokens per second.
    - Throughput per H800 node: 73.7k input tokens per second (prefill), 14.8k output tokens per second (decode).
    - Daily cost: $87,072 (peak nodes: 278, average nodes: 226.75).
    - Theoretical daily revenue: $562,027, with a profit margin of 545%.

Leave an answer