Higher latencies with NNAPI on Snapdragon 888

anamika · Apr 20, 2021

I was looking at the ranking available on your website-
https://ai-benchmark.com/ranking_detailed.html and observed that on
Snapdragon 888 for multiple models, NNAPI shows better latency numbers
compared to CPU for both int and float models.
Recently, I tested some DL models on Snapdragon 888 using TfLite
framework on CPU and NNAPI, but observed higher latency numbers with
NNAPI compared to CPU for FP16 and A16W8 models.
I read in TfLite documentation that fallback to CPU is disabled by
default for Android 10 (API level 29). The 888 device I use has a
higher Android version (11) and API level (30).
Still I am doubtful if NNAPI is being correctly utilized, due to the
latency numbers.
I have 2 questions-
1. Are there any drivers that are required by NNAPI, to work
correctly? If yes, can you please specify?
2. In TfLite, how to verify if the delegates are invoked correctly?

Andrey Ignatov · Apr 21, 2021

anamika said:
Are there any drivers that are required by NNAPI, to work
correctly? If yes, can you please specify?

Yes and no. NNAPI HAL is a part of Android firmware image. You cannot install these drivers separately, they are either integrated by the corresponding smartphone vendor or not. If you are working with some prototype devices or boards, it is likely that NN HAL is missing there since SDM888 is a new chipset.

anamika said:
In TfLite, how to verify if the delegates are invoked correctly?

There is some functionality, you can find its description here.

anamika · Apr 22, 2021

Thank you so much for the information.
I will check more about the device and the functionalities mentioned in the shared link.

Higher latencies with NNAPI on Snapdragon 888

anamika

New member

Andrey Ignatov

Administrator

anamika

New member