Performance Test

Prev Next

This documentation provides a detailed summary of the performance tests conducted on our product, capturing various metrics across different threading configurations and segments.

image.png

Sample Dialogue Durations:

Short Segments (in seconds)

Threads 1 10 50
Dialogue Duration 496.0 602.0 1845.0
Talking Duration 191.64 191.64 191.64
Playing Audio Duration 191.64 191.64 191.64
Translation Duration 111.0 223.0 1458.0
Average Segment Duration 3.6 3.6 3.6
Average Translation Duration 2.1 4.2 27.5

Long Segments (in seconds)

Threads 1 10 50
Dialogue Duration 477.0 478.0 997.0
Talking Duration 210.0 210.0 210.0
Playing Audio Duration 210.0 210.0 210.0
Translation Duration 57.0 58.0 577.0
Average Segment Duration 10.5 10.5 10.5
Average Translation Duration 2.85 2.9 28.85

Average Durations for 100 Dialogues:

Short Segments (in seconds)

Threads 1 10 50
Average Dialogue Duration 467.53 541.08 1464.91
Average Segment Duration 4.10 4.10 4.10
Average Translation Duration 2.15 3.75 24.17

Long Segments (in seconds)

Threads 1 10 50
Average Dialogue Duration 539.79 541.39 1139.21
Average Segment Duration 11.95 11.95 11.95
Average Translation Duration 3.09 3.17 33.06

Segment Statistics

Min Avg Max
Short Segments 1 4 29
Long Segments 5 12 19

Comparable Summary

1 Thread – 200 Samples 10 Thread – 20 Samples
Case Throughput Average Delay
All Pipeline 0.21 4875.53
Only Whisper 0.49 2038.53
Only Whisper (Params Changed) 0.49 2045.80
Only Google Translate 3.16 315.86
Only Azure TTS 3.11 320.76

Note: In tests labeled "Only," the actions like network-based audio input/output and saving audio to disk are not disabled; hence, they are not performing only inference.

Two Server Tests

Server1 Server2
Test Omp_num_threads Capacity Limiter
(5 threads – 20 samples) x 2 4 4

All Pipeline Tests

Omp_num_threads Capacity Limiter Throughput Average Delay
1 thread – 200 samples 4 4 0.21 4875.53
5 threads – 40 samples 4 4 0.49 9935.84
10 threads – 20 samples 4 4 0.50 19451.39
1 thread – 200 samples 8 8 0.18 5443.84
5 threads – 40 samples 8 8 0.51 9406.90
10 threads – 20 samples 8 8 0.54 18050.80

All Pipeline - Whisper Param Changed

Omp_num_threads Capacity Limiter Throughput Average Delay
1 thread – 200 samples 4 4 0.20 4926.17
5 threads – 40 samples 4 4 0.59 8296.78
10 threads – 20 samples 4 4 0.55 17268.97

Only Whisper Tests

Omp_num_threads Capacity Limiter Throughput Average Delay
1 thread – 200 samples 4 4 0.49 2038.53
5 threads – 40 samples 4 4 0.55 8685.92
10 threads – 20 samples 4 4 0.54 18019.95

Model Parameters Changed

  • cpu_threads=os.environ["OMP_NUM_THREADS"]
  • num_workers=os.environ["OMP_NUM_THREADS"]
Omp_num_threads Capacity Limiter Throughput Average Delay
1 thread – 200 samples 4 4 0.49 2045.80
5 threads – 40 samples 4 4 0.71 6893.10
10 threads – 20 samples 4 4 0.70 13391.11

Only Google Translate

Omp_num_threads Capacity Limiter Throughput Average Delay
1 thread – 200 samples 4 4 3.16 315.86
5 threads – 40 samples 4 4 9.59 476.84
10 threads – 20 samples 4 4 918.03

Only Azure TTS

Omp_num_threads Capacity Limiter Throughput Average Delay
1 thread – 200 samples 4 4 3.11 320.76
5 threads – 40 samples 4 4 10.13 468.56
10 threads – 20 samples 4 4 9.95 950.62