ModalAI Forum
    • Categories
    • Recent
    • Tags
    • Popular
    • Users
    • Groups
    • Register
    • Login
    1. Home
    2. Rowan Dempster
    • Profile
    • Following 0
    • Followers 0
    • Topics 30
    • Posts 81
    • Best 0
    • Controversial 0
    • Groups 0

    Rowan Dempster

    @Rowan Dempster

    0
    Reputation
    15
    Profile views
    81
    Posts
    0
    Followers
    0
    Following
    Joined Last Online

    Rowan Dempster Unfollow Follow

    Latest posts made by Rowan Dempster

    • Migrating from QVIO to OpenVINS (SDK1.6)

      Hi Modal,

      Cleo is in the evaluation phase of switching from QVIO to OpenVINS (specifically version https://gitlab.com/voxl-public/voxl-sdk/services/voxl-open-vins-server/-/tags/v0.5.7). As part of this process, I would like to gather the latest information from the VOXL-OpenVINS developers regarding what to expect in terms of relative performance moving from QVIO to OpenVINS. Additionally, are there any platform/airframe specific tuning routines (besides of course cam<->imu extrinsics and cam intrinsics)? We have a very different airframe than the starling 🙂 https://cleorobotics.com/specifications/

      Some general areas that come to mind that we didn't touch on QVIO but might be relevant to OpenVINS:

      • IMU static intrinsics (calibrated once)
      • IMU dynamic calibration (ie temperature based)
      • IMU noise tuning
      • IMU<->Camera time synchronization
      • Synchronization of 3 camera frames (we have 3 non-overlapping tracking cameras)
      • The parameters in the different https://gitlab.com/voxl-public/voxl-sdk/services/voxl-open-vins-server/-/tree/master/misc_files/usr/share/modalai/voxl-open-vins/VoxlConfig?ref_type=heads files, which I noticed were not the same.

      I'm sure I'm forgetting other areas that could affect OpenVINS performance, I'm by no means a CV or filter expert 🙂 Overall, we want our migration to be as seamless as possible and result in the best performance of OpenVINS as possible on our airframe with our sensor suite. Your assistance in this is very much appreciated!

      posted in GPS-denied Navigation (VIO)
      Rowan DempsterR
      Rowan Dempster
    • RE: Minimizing voxl-camera-server CPU usage in SDK1.6

      @Alex-Kushleyev Hi Alex, I appreciate you looking into our use case for us (as always!). Please let me know if I can help by providing more details regarding the structure of our clients that are consuming the camera pipes. If there is a detail Cleo can't share publicly on the forum I can always email you as well or hop on a quick call to elaborate.

      Understood about the misp pipes, that expensive read access would explain both point #1 and the strange part of point #2 in my original post.

      We at Cleo will be monitoring this closely, since CPU usage regressions is pretty much the only gating item for us upgrading our robotic perception stack to SDK1.6. The misp_norm pipes are a great benefit to that perception stack and we'd of course love to take advantage of them as soon as possible. Thus we are definitely open to trying out zero-copy client approaches for keeping CPU usage down, or any other optimizations you could share examples of for us to try out on our use case.

      posted in Video and Image Sensors
      Rowan DempsterR
      Rowan Dempster
    • Minimizing voxl-camera-server CPU usage in SDK1.6

      Hi Modal,

      As we at Cleo update voxl-camera-server to SDK 1.6 (from SDK 1.0 with lots of backported changes), I've done some preliminary CPU profiling and have some questions on how to keep voxl-camera-server CPU usage down in SDK 1.6. Two things stand out to me:

      1. Using the new tracking_misp_norm pipe for our tracking cameras uses ~25% more of a CPU core than using the tracking pipe. 25% is total change across our 3 tracking cameras. So the usage goes from ~85% of a core to ~110% of a core (exact numbers change based on core allocation). To illustrate this you can run voxl-inspect-cam top bottom back hires_small_color tof_depth, record the CPU usage of voxl-camera-server, and then run voxl-inspect-cam top_misp_norm bottom_misp_norm back_misp_norm hires_small_color tof_depth and record the CPU usage again and compare. Is there a way to prevent this from happening, or can you explain why the tracking_misp_norm pipes use more CPU?

      2. Adding additional clients to the camera pipe topics causes voxl-camera-server to use more CPU. To illustrate this I only run voxl-camera-server (no other services), then inspect our baseline pipes (e.g. voxl-inspect-cam top_misp_norm bottom_misp_norm back_misp_norm hires_small_color tof_depth) and then in a new terminal(s) I open new clients again using voxl-inspect-cam. For example, when I open 4 new terminals (on top of the baseline terminal) and run voxl-inspect-cam top_misp_norm bottom_misp_norm back_misp_norm in each, then the voxl-camera-server CPU usage increases ~70% (from ~110% of a core, to ~180% of a core). This behavior on its own doesn't quite make sense to me, why would additional clients change the CPU load of the server, and by that much? The images should only be computed once, not per client? Furthermore, what's extra strange is that which tracking pipe is used also matters. If the baseline terminal is instead running voxl-inspect-cam top bottom back hires_small_color tof_depth and the 4 new client terminals are running voxl-inspect-cam top bottom back, then the CPU usage increase only by ~7% (from ~85% to ~92%). The different behavior between type of tracking pipe doesn't make sense to me, could you explain it?

      Overall, at Cleo we are looking to use the tracking_misp_norm pipes going forward, and be able to have multiple clients consuming those pipes with having to worry about increasing CPU usage of the voxl-camera-server. If you could comment on whether these asks are possible, or explain why not (either at this time, or never), that would help us a great deal in our process of taking advantage of Modal's great work in robotic perception!

      posted in Video and Image Sensors
      Rowan DempsterR
      Rowan Dempster
    • Optimizing DSP Load wr.t. IO

      Hi Modal,

      When upping the IMU_GYRO_RATEMAX (for lower end-to-end worse case latency in the gyro movement -> actuation loop), I'm seeing the DSP load jump (measured by px4-listener cpuload). After digging into it, it looks like the main culprit is not the processing modules, but rather the FIFO read (transfer((uint8_t *)&buffer, (uint8_t *)&buffer, transfer_size) in the ICM42688P driver) and the UART write (uart_write(&data, sizeof(DataPacket)) != sizeof(DataPacket)) to our STM board (for actuation).

      About 40% of the DSP is being used for the FIFO transfer and the UART write with IMU_GYRO_RATEMAX set to 1kHz (1000Hz).

      Any recommendations on how to minimize the DSP being choked by these IO operations?

      I can only think of marginal gains like decreasing how many bytes the FIFO is configured to contain, or optimizing our DataPacket size down to like 12 bytes (currently 17 bytes). But maybe there is some more fundamental strategy that I'm missing? Or is IO just generally a bad time for the DSP?

      Thanks for any advice!

      Rowan (Cleo Robotics)

      posted in VOXL 2
      Rowan DempsterR
      Rowan Dempster
    • RE: Recording RoyaleRecordingFile format from the ToF

      Hey guys any words of advice here?

      posted in Image Sensors
      Rowan DempsterR
      Rowan Dempster
    • Recording RoyaleRecordingFile format from the ToF

      Hi ModalAI,

      We at Cleo have been corresponding with PMD (the ToF supplier) about how to tune the ToF for optimal tradeoff between false positives (floating artifacts) and false negatives (not seeing dark surfaces). One of the main action items from our discussion (which I had ChatGPT summarize below for your reference) is the need to record what PMD calls the "RoyaleRecordingFile" format from the ToF. This will unlock the ability to tune processing parameters on a dataset of pre-recorded files, which makes large scale tuning feasible. Has ModalAI looked into producing the "RoyaleRecordingFile" format from the ToF using the VOXL2?

      Thank you,

      Rowan

      Summary of Email Correspondence on Recording RRF Files on VOXL2

      The core discussion revolves around whether it is feasible to record RRF (Royale Recording Format) files on the VOXL2 platform, which is used by Cleo Robotics in conjunction with a 3D Time-of-Flight (ToF) sensor.


      1. Motivation: Reducing False Positives and False Negatives

      Cleo Robotics has observed conflicting effects from tuning software parameters:

      • Increasing exposure time and adjusting detection thresholds sometimes improves detection of dark surfaces but introduces false positives (e.g., floating points or noise on edges).
      • Conversely, stricter thresholds reduce false positives but lead to false negatives, especially with low-reflectivity (dark) objects.

      This trade-off has motivated the team to find an optimal set of processing parameters that minimizes both FP and FN. To do this effectively, they are seeking a way to record RRF files, which would enable offline experimentation and replay with different tuning values.


      2. VOXL2 Limitations and Inquiry to PMD

      Rowan Dempster reached out to PMD Technologies to:

      • Ask whether RRF files can be recorded directly from the 3D ToF sensor on VOXL2.
      • Understand the value of producing RRF files and whether recorded data could be “played back” with alternative parameters for comparative analysis.
      • Explore potential development board solutions for this purpose.

      3. PMD’s Feedback and ModalAI Software Potential

      Tim Werner from PMD explained that:

      • The VOXL2 platform does not natively support RRF recording.
      • However, the underlying software stack used by ModalAI includes this capability through CAMX, suggesting it is technically feasible.
      • Rowan was encouraged to contact ModalAI to request access or support for enabling this feature.
      • PMD expressed willingness to assist ModalAI if needed.

      4. Cleo’s Proposed Workaround and Implementation Details

      Rowan noted:

      • They are considering using a separate development board (if available) that supports RRF recording for tuning.
      • Once optimal parameters are found, the same could be transferred to the VOXL2 by modifying the 3D ToF driver (e.g., setSpectreParams in VOXL SDK).
      • This avoids the repeated burden of recompiling and redeploying firmware for each test case.

      Rowan also provided a Google Drive link and GitLab reference to their current tuning approach for PMD’s review.


      5. PMD’s Supplemental Suggestions

      In addition to the ModalAI pathway, PMD suggested:

      • Cleo could share their current parameter values.
      • PMD might then provide tuning suggestions to improve detection performance, although this would be less precise than replaying real RRF data.

      Conclusion

      • Recording RRF files on VOXL2 is not currently supported, but may be possible with cooperation from ModalAI.
      • Cleo’s goal is to optimize ToF sensor parameters to reduce both false positives and false negatives, particularly for dark surfaces.
      • Next steps involve contacting ModalAI to explore unlocking RRF functionality, with potential support from PMD.
      • In parallel, Cleo can continue parameter tuning directly on VOXL2 through driver-level code adjustments, albeit with more effort.

      If helpful, we’d be happy to provide a concise technical summary of our current parameter usage or collaborate on getting the RRF feature exposed.

      posted in Image Sensors
      Rowan DempsterR
      Rowan Dempster
    • RE: Camera server: Preview stream stops when Large video activated

      @Alex-Kushleyev Gotcha, thanks for the info Alex and your help resolving this issue!

      posted in Video and Image Sensors
      Rowan DempsterR
      Rowan Dempster
    • RE: Camera server: Preview stream stops when Large video activated

      @Alex-Kushleyev

      So, please try the code in the branch (with frame id fix) with different roi sources and let me know how it behaves. It should not matter whether the camera is IMX214 or IMX412, but i have not tried with IMX214. If this branch has the issue with IMX214 based on your testing, i can try it.

      Is there any reason to use the preview source? I would prefer to leave that stream disabled since we do not need it for anything except (in the past) the ROI pipe (but now that ROI works on the large video source), and we are already using the large video pipe for recording videos.

      Also, if you have not already done so on your branch, please lock voxl-camera-server to use only upper 4 cores, as it significantly improves camera server performance when the load is high (the low level ISP interrupts happen on lower cores only, so this way camera server is not interfering). The reason why i mention this is that you are using an old branch of camera server, so just double checking.

      Yes we have the voxl-camera-server locked to 4-7:

      [Unit]
      Description=voxl-camera-server
      SourcePath=/usr/bin/voxl-camera-server
      After=voxl-wait-for-fs.service
      Requires=voxl-wait-for-fs.service
      
      [Service]
      User=root
      Type=simple
      PIDFile=/run/voxl-camera-server.pid
      ExecStart=/usr/bin/taskset -c 4-7 /usr/bin/voxl-camera-server
      
      posted in Video and Image Sensors
      Rowan DempsterR
      Rowan Dempster
    • RE: Camera server: Preview stream stops when Large video activated

      @Alex-Kushleyev Looking at that branch you sent, I might see why you are having better luck than us. We at Cleo only have the code up to and including add roi transition filter to control command; re-enable publishing of preview...

      In green are the commits we have in our codebase, in red are the commits we do not have.

      b057d9e8-9955-4086-ab9a-57830d65a7c9-image.png

      I note that in the commits we do not have you added the option to change the ROI pipe source:

          enum ROI_SOURCE_STREAM {
              ROI_SOURCE_PREVIEW,
              ROI_SOURCE_VIDEO_SMALL,
              ROI_SOURCE_VIDEO_LARGE
          };
      

      and default it to

              roi_source_stream    = ROI_SOURCE_VIDEO_LARGE; //ROI_SOURCE_PREVIEW; //ROI_SOURCE_VIDEO_SMALL // ROI_SOURCE_VIDEO_LARGE
      

      Perhaps in the code you tried yesterday the ROI source is the large encoded stream, whereas with my version of the code the ROI source is the preview stream. I will try to port over those five most recent commits from your work on ROI that we do not have in our code base and hopefully get the ROI pipe to work well using the large encoded stream, and that will serve as a solution (or rather no longer have the need) to get the preview stream working alongside the large encoded stream. Thank you for your help!!

      posted in Video and Image Sensors
      Rowan DempsterR
      Rowan Dempster
    • RE: Camera server: Preview stream stops when Large video activated

      @Alex-Kushleyev Hi Alex,

      I am assuming that you are using non-raw preview stream (YUV) at resolution around 4000x3000 and similar resolution for the large encoded stream. Then the ROI stream is the downscaled version of the preview stream (please correct me if I am wrong).

      That is correct yes.

      I just tested this setup using IMX412 camera, and using the following branch (which I believe you used as a starting point):
      what camera you are using

      We are using the IMX214, not the IMX 412 - not sure if that would cause a different outcome

      Also try disabling all other cameras and seeing if the issue goes away.

      Only running the hires camera -> the issue persists.

      voxl-camera-server.conf config section for that camera (assuming other cameras do not interfere)

      Other cameras do not seem to impact results. Here is the config entry

      	"cameras":	[{
      			"type":	"imx214",
      			"name":	"hires",
      			"enabled":	true,
      			"camera_id":	0,
      			"fps":	25,
      			"en_preview":	true,
      			"preview_width":	4208,
      			"preview_height":	3120,
      			"en_raw_preview":	false,
      			"en_small_video":	true,
      			"small_video_width":	1024,
      			"small_video_height":	768,
      			"small_venc_mode":	"h265",
      			"small_venc_br_ctrl":	"cqp",
      			"small_venc_Qfixed":	30,
      			"small_venc_Qmin":	15,
      			"small_venc_Qmax":	40,
      			"small_venc_nPframes":	9,
      			"small_venc_mbps":	2,
      			"en_large_video":	true,
      			"large_video_width":	4208,
      			"large_video_height":	3120,
      			"large_venc_mode":	"h265",
      			"large_venc_br_ctrl":	"cqp",
      			"large_venc_Qfixed":	42,
      			"large_venc_Qmin":	17,
      			"large_venc_Qmax":	52,
      			"large_venc_nPframes":	29,
      			"large_venc_mbps":	30,
      			"en_snapshot":	false,
      			"en_snapshot_width":	1024,
      			"en_snapshot_height":	768,
      			"en_roi":	true,
      			"roi_width":	1152,
      			"roi_height":	720,
      			"ae_mode":	"isp"
      		}]
      

      https://gitlab.com/voxl-public/voxl-sdk/services/voxl-camera-server/-/tree/feature-add-c2d-select-roi-src/

      I will try out this code now!

      posted in Video and Image Sensors
      Rowan DempsterR
      Rowan Dempster