Can anyone recommend a Tflite Colab Notebook for VOXL2 Training
-
Thanks for an informative response. One thing that's confusing me is the message
"ERROR in pipe_client_init_channel"
as thepipe_client_init_channel
method is deprecated. Do you mind letting me know what version of the SDK you're on? If it isn't the most recent SDK, it's probably worth upgrading to see if it fixes anything. I know we've put out a lot of changes inlibmodal-pipe
. You can read how to flash the latest SDK here.Unfortunately just from these debug messages I can't pin down the issue and so I might need you to provide me with a model file to help out more. I can understand if you don't want to leak your trained model file, though. One thing you could do in this case would be to just train for a single epoch just as a means of creating a model through the same process. If I have a model file I can do some more rigorous debugging to determine the issue.
Thanks and sorry about all of this!
Thomas Patton
thomas.patton@modalai.com -
@Thomas-Patton
voxl2:/$ voxl-version
system-image: 1.6.2-M0054-14.1a-perf
kernel: #1 SMP PREEMPT Fri May 19 22:19:33 UTC 2023 4.19.125hw version: M0054
voxl-suite: 1.0.0
will update to 1.0.1
Can I email you my tflite and saved model for review? I'm doing a run right now that should be completed in a couple hours.
Sabri -
@sansoy You should upgrade to the latest SDK (1.1.2)
-
@tom so i downloaded the upgrade and started the upgrade but its been stuck for about an hour.
How long does it take to flash the upgrade?
SabriFlashing the following System Image:
Build Name: 1.7.1-M0054-14.1a-perf-nightly-20231025
Build Date: 2023-10-25
Platform: M0054
System Image Version: 1.7.1Installing the following version of voxl-suite:
voxl-suite Version: 1.1.2Would you like to continue with SDK install?
- Yes
- No
#? yes
[ERROR] invalid option
#? 1
[INFO] adb installed
[INFO] fastboot installed
---- Starting System Image Flash ----
----./flash-system-image.sh ----
Detected OS: LinuxInstaller Version: 0.8
Image Version: 1.7.1Please power off your VOXL, connect via USB,
then power on VOXL. We will keep searching for
an ADB or Fastboot device over USB
[INFO] Found ADB device
[INFO] Rebooting to fastboot
.
[INFO] Found fastboot device
[WARNING] This system image flash is intended only for the following
platform: VOXL2 (m0054)Make sure that the device that will be flashed is correct. Flashing a device with an incorrect system image will lead the device to be stuck in fastboot.
Would you like to continue with the VOXL2 (m0054) system image flash?
- Yes
- No
#? 1
-
@sansoy It should start right away, I would power cycle your voxl2 and try again
-
@tom i did all that and still stuck. could it be whats in the warning about being stuck in fastboot?
it is the voxl2 and not the voxl2 mini.[WARNING] This system image flash is intended only for the following
platform: VOXL2 (m0054)Make sure that the device that will be flashed is correct. Flashing a device with an incorrect system image will lead the device to be stuck in fastboot.
-
@sansoy As long as you are using the voxl2 SDK and are indeed flashing voxl2 hardware then that warning can be ignored.
-
@tom hey Tom, i'm having absolutely no luck.
i've tried 3 times and it still just hangs atWould you like to continue with the VOXL2 (m0054) system image flash?
- Yes
- No
#? 1
I then followed the unbrick instructions and reinstalled everything per
https://docs.modalai.com/voxl2-unbricking/#ubuntu-hostGot the system back up and running and tried to install the latest SDK again with no luck.
It just hangs. -
UPDATE: Got it working with "sudo" for the install. normally one would get a permission errors and thought maybe that was the issue and sure enough. recommend updating your docs to
say sudo ./install.sh -
@tom so i trained on a new batch of AR15 images and got really good numbers in terms of losses and mAPs. Ran an unquantized and quantized version in voxl-tflite-server and again nothing is being recognized.
Here's a link to the tflites, and saved_models with inference results on never before seen images.
Any insight on how to make these models work in your environment would be awesomely appreciated.https://drive.google.com/drive/folders/1N1pU0jMRTb3rODSfIuETPrBf66m4ody7?usp=drive_link
-
@sansoy Interesting, sudo isn't normally required. I'm curious, what linux distro are you running?
-
@sansoy @Thomas-Patton is the ML expert here and I'll let him comment on that front
-
@tom Ubuntu 22.04.3 LTS
-
@sansoy Huh, okay, that's what I run as well.
What groups are your default user in? For example, here is mine:
~ groups ok | 10:20:36 AM tom adm dialout cdrom sudo dip plugdev lpadmin lxd sambashare docker
-
@tom eve@eve:~$ groups
eve adm cdrom sudo dip plugdev lpadmin lxd sambashare -
@sansoy Can you try adding your user to the dialout group and seeing if that fixes the issue?
sudo usermod -a -G dialout $USER
-
@tom did that and still no inference.
-
@sansoy That was for fixing the fastboot issue, unrelated
-
When I get some time today I'll try to download your .tflite models and see what's going on. The good news is that the server is at least running! It very well may just be an issue with how the tensor is being parsed.
Thomas
thomas.patton@modalai.com -
Hey, they just gave me access to the Google Drive folder. Can you confirm that the
edgetpu.tflite
in the root directory is the file you want me to try and get working?Thomas