AF SDK concurrency question

Question

I recently encountered a performance issue with the AF SDK while working on a project, and I wanted to reach out to the community for insights and suggestions.

I have created a sample code (available at https://github.com/seeq12/AFProbe) to investigate the concurrency behavior of the AF SDK. My observations indicate that when using this code to retrieve samples for 4000 attributes on 64 threads, the task completes in 900s. If I load the samples for the same 4000 attributes on only one thread, the task takes 3000s. This suggests that there might be a concurrency bottleneck, as no more than 3.3 requests appear to be served in parallel.

In an attempt to address this, I adjusted the PIArchss_ThreadCount parameter from 32 to 64, matching the number of cores on our data archive servers (32 cores). Unfortunately, this change did not result in the expected improvement.

I'm reaching out to seek guidance and advice from the community. Are there any other parameters or configurations that could potentially enhance the concurrency performance of the AF SDK? I want to ensure that I'm optimizing the system to its fullest potential.

Additionally, if there are any known limitations or factors outside the scope of configuration that may impact AF SDK concurrency, I would be grateful for any insights on that front as well.

Thank you for your time and assistance.

Rick_Davin_3.0 · Accepted Answer

First, let's start with some background reading. This link is the granddaddy of them all regarding:

AF SDK performance, serial vs. parallel vs. bulk

Next, I would strongly emphasize that you should not try to change the PI Tuning parameters to make your app seem faster. I adhere to the philosophy that the PI Server is tuned the way it is for a reason and that I need to adjust my app to work with that PI Server as it is, and not the other way around.

Other guidelines of my philosophy:

Do NOT overwhelm a server with my app since it may choke out other users or more importantly, other data collection services.
Prefer Bulk over Per Tag Calls.
Where justified, consider parallel or parallel + Bulk calls.

There was a presentation at vCampus Live! in the early 2010's. For parallel calls, the main thought it to take advantage of "transit time" aka network latency. It takes time for you to send a remote request and to get a response back. You want to keep the client busy almost constantly processing data coming back, and not to be idle waiting on transit time.

With that in mind, and that you do not want to choke out a PI Server, the thought is to use a MaxDegreeOfParallelism of 3 for PI Servers. That relates to fetching of data.

One good thing about parallel calls is that it can potentially improve the performance of your app. One bad thing about parallel calls is that it can potentially degrade the performance of your app. It all boils down to what the app is doing and how you issue parallel calls.

There are just so many factors specific to your PI environment and your app that it is hard to give a one-size-fits-all answer. Some users may have 1 million tags that have 24 readings per tag per day. Another user may have 10K tags that have 84K readings per tag per day (i.e. every second). Yet another user may have a mix of sparse data versus dense data, and this user is the most problematic.

AF SDK concurrency question

Top Replies