NickSwardh NuGet Nvidia Jetson Orin Nano™ GPU CUDA Inferencing

Posted on December 30, 2024 by devmobilenz

My Seeedstudio reComputer J3011 has two processors an ARM64 CPU and an Nividia Jetson Orin 8G coprocessor. YoloDotNet by NickSwardh V2 (uses SkiaSharp) was significantly faster when run on the ARM64 CPU so I wanted to try inferencing with the Nividia Jetson Orin 8G coprocessor.

Performance of YoloDotNet by NickSwardh V2 running on the ARM64 CPU

Performance of YoloDotNet by NickSwardh V2 running on the Nividia Jetson Orin 8G with Compute Unified Device Architecture (CUDA) enabled.

Enabling CUDA reduced the total image scaling, pre-processing, inferencing, and post processing time from 115mSec to 36mSec which is a significant improvement.

YoloDotNet NuGet on a diet – Part 1

Posted on October 30, 2024 by devmobilenz

Several of my projects use the NickSwardh/YoloDotNet NuGet which supports NVIDIA CUDA but not TensorRT. The first step before “putting the NuGet on a diet” was to fix up my test application because some of the method signatures had changed in the latest release.

// load the app settings into configuration
var configuration = new ConfigurationBuilder()
   .AddJsonFile("appsettings.json", false, true)
   .Build();

_applicationSettings = configuration.GetSection("ApplicationSettings").Get<Model.ApplicationSettings>();

Console.WriteLine($" {DateTime.UtcNow:yy-MM-dd HH:mm:ss.fff} YoloV8 Model load start : {_applicationSettings.ModelPath}");

//using (var predictor = new Yolo(_applicationSettings.ModelPath, false))
using var yolo = new Yolo(new YoloOptions()
{
   OnnxModel = _applicationSettings.ModelPath,
   Cuda = false,
   PrimeGpu = false,
   ModelType = ModelType.ObjectDetection,
});
{
   Console.WriteLine($" {DateTime.UtcNow:yy-MM-dd HH:mm:ss.fff} YoloV8 Model load done");
   Console.WriteLine();

   //using (var image = await SixLabors.ImageSharp.Image.LoadAsync<Rgba32>(_applicationSettings.ImageInputPath))
   using (var image = SKImage.FromEncodedData(_applicationSettings.ImageInputPath))
   {
      Console.WriteLine($" {DateTime.UtcNow:yy-MM-dd HH:mm:ss.fff} YoloV8 Model detect start");

      var predictions = yolo.RunObjectDetection(image);

      Console.WriteLine($" {DateTime.UtcNow:yy-MM-dd HH:mm:ss.fff} YoloV8 Model detect done");
      Console.WriteLine();

      foreach (var predicition in predictions)
      {
         Console.WriteLine($"  Class {predicition.Label.Name} {(predicition.Confidence * 100.0):f1}% X:{predicition.BoundingBox.Location.X} Y:{predicition.BoundingBox.Location.Y} Width:{predicition.BoundingBox.Width} Height:{predicition.BoundingBox.Height}");
      }
      Console.WriteLine();

      Console.WriteLine($" {DateTime.UtcNow:yy-MM-dd HH:mm:ss.fff} Plot and save : {_applicationSettings.ImageOutputPath}");

      using (SKImage skImage = image.Draw(predictions))
      {
         //await image.SaveAsJpegAsync(_applicationSettings.ImageOutputPath);
         skImage.Save(_applicationSettings.ImageOutputPath, SKEncodedImageFormat.Jpeg);
      }
   }
}

While testing I found the application worked with the sample Ultralytics YoloV8 ONNX models but failed with my Ultralytics Hub trained models.

The YoloDotNet code was looking for specific text in the model description which wasn’t present in the description of my Ultralytics Hub trained models.

I downloaded the YoloDotNet source and included the core project in my solution so I could temporarily modify the GetModelVersion method in OnnxPropertiesExtension.cs.

 /// <summary>
 /// Get ONNX model version
 /// </summary>
 private static ModelVersion GetModelVersion(string modelDescription) => modelDescription.ToLower() switch
 {
     var version when version.Contains("yolo") is false => ModelVersion.V8,
    var version when version.Contains("yoloV8") is false => ModelVersion.V8, // <========
    var version when version.StartsWith("ultralytics yolov8") => ModelVersion.V8,
     var version when version.StartsWith("ultralytics yolov9") => ModelVersion.V9,
     var version when version.StartsWith("ultralytics yolov10") => ModelVersion.V10,
     var version when version.StartsWith("ultralytics yolo11") => ModelVersion.V11, // Note the missing v in Yolo11
     var version when version.Contains("worldv2") => ModelVersion.V11,
     _ => throw new NotSupportedException("Onnx model not supported!")
 };

After getting the test application running in the Visual Studio 2022 debugger it looked like the CustomMetadata Version info would be a better choice.

To check my assumption, I inspected some of the sample ONNX Model properties with Netron.

It looks like the CustomMetadata Version info increments but doesn’t nicely map to the Ultralytics Yolo version.

YoloV8-NuGet Performance ARM64 CPU

Posted on September 30, 2024 by devmobilenz

To see how the dme-compunet, updated YoloDotNet and sstainba NuGets performed on an ARM64 CPU I built a test rig for the different NuGets using standard images and ONNX Models.

I started with the dme-compunet YoloV8 NuGet which found all the tennis balls and the results were consistent with earlier tests.

The YoloDotNet by NickSwardh NuGet update had some “breaking changes” so I built “old” and “updated” test harnesses.

The YoloDotNet by NickSwardh V1 and V2 results were slightly different. The V2 NuGet uses SkiaSharp which appears to significantly improve the performance.

Even though the YoloV8 by sstainba NuGet hadn’t been updated I ran the test harness just in case

The dme-compunet YoloV8 and NickSwardh YoloDotNet V1 versions produced the same results, but the NickSwardh YoloDotNet V2 results were slightly different.

dme-Compunet 291 mSec
NickSwardV1 480 mSec
NickSwardV2 115 mSecs
SStainba 422 mSec

Like in the YoloV8-NuGet Performance X64 CPU post the NickSwardV2 implementation which uses SkiaSharp was significantly faster so it looks like Sixlabors.ImageSharp is the issue.

To support Compute Unified Device Architecture (CUDA) or TensorRT inferencing with NickSwardV2(for SkiaSharp) will need some major modifications to the code so it might be better to build my own YoloV8 Nuget.

devMobile's blog

Random wanderings through Microsoft Azure esp. PaaS plumbing, the IoT bits, AI on Micro controllers, AI on Edge Devices, .NET nanoFramework, .NET Core on *nix and ML.NET+ONNX

Tag Archives: SkiaSharp

NickSwardh NuGet Nvidia Jetson Orin Nano™ GPU CUDA Inferencing

YoloDotNet NuGet on a diet – Part 1

YoloV8-NuGet Performance ARM64 CPU