Perhaps time to shoot this and reload as it appears the only device still going is the Mountaineer and even it has lots of errors. NOTE when I say errors I mean something caused the upload to choke, why it choked is for investigation later as it could be a ton of things, but looking at the charts memory leaks don’t seem to be a problem with any of the boards as memory usage was consistent throughout the test.
Mountaineer (running 4.3) was still going (6450 entries recorded, and 114 errors)
Cerberus made 2570 iterations with 3 errors
Hydra made 2570 iterations with 3 errors as well, perhaps a ENC28 issue?
Spider with Wifi 1361 iterations, but no errors
I had hoped that the error messages would appear on ThingSpeak as I put them in the status, but perhaps there are text limitations or something that prevented this from working. I’ll experiment with this some more as I really want those error messages and perhaps what I’ll end up doing is a SD Card based log file or something.
I’m with you Gus on this as I know we would both like this to be absolutely perfect. I think I’ll add a Spider with a J11D module to the suite of test platforms. The other thing I think I’ll do this time around is put all of these through the same router (wired went through one router and the wifi went through another) and record the syslog data from that router and see if there is any information in there that might help figure this out.
The only device still going is the Mountaineer and even it has lots of errors.[/quote]
It is good to see the diligent work of Pierre-Yves on the lwIP integration code paying off, and it is consistent with the feedback from our commercial customers: no need to reboot a device, just to get networking up again.
I’d definitely would like to learn more about the root causes of these errors (exceptions), and whether they are normal error situations as they may occur in IP networks, or not. Whether they depend on the router model, network saturation, etc. We had comparable tests, over longer periods of time, without such errors. To be able to reproduce them would be great.
Thanks for the testing effort, highly appreciated!
Now that I’ve seen some errors, time to figure out what they are, hence capturing the details and router logs etc. I think a lot of people falsely assume the internet just works all the time, when in fact there are so many potential points of failure that its enough to spin your head and your device. Time to find and line up all the dragons from biggest to smallest and start slaying them as networking is one of those key ‘gotta work’ features for devices.
Cuno I was impressed how the Mountaineer just kept plowing on so please pass my appreciation onto Pierre-Yves for his impressive work around networking.
What does this mean? It just went to 1361 iterations and didn’t try to upload anymore but there was no error thrown? So, there was an error throwing errors???
[/quote]That would be the million dollar question, and maybe on the next round I’ll have an answer.
Most likely your devices ran into the connect hangs forever bug. The Mountaineer is running 4.3, which has a fix for this or so I have heard. If you modify your code to run the internet connection part in a different thread and kill the thread after a timeout of several to 30 seconds, I am sure you will see it run for longer.
I had a device running 4.1 on a CANXtra that did over 150K iterations with two updates each iteration. It probably would have kept on going, but the windstorm took out power and internet.