Unfortunately I do not have any info from you to get started with any kind of testing. Ideally it would be a test method that (even randomly) produces an error
Mark,
What was discovered previously?
I can’t be the only one getting these errors, can I?
My crons get these errors maybe once a week. If you read up in the thread I’ve been posting when this happens. Generally it’s not often, but tonight was significant failures.
Tim
I just got these -
Error: Backendless encountered an error while handling the request. An internal trouble ticket with ID bl-server:0B855C07-B804-4061-8B82-D18ADD1D8627 has been created and we will be investigating the issue.
Error: Backendless encountered an error while handling the request. An internal trouble ticket with ID bl-server:41D38F49-1E3F-468E-8393-3FC88F202E05 has been created and we will be investigating the issue.
Both errors have this as the cause -
Caused by: java.net.UnknownHostException
This seems related. Can you look at the details in the ticket?
Tim
@mark-piller @Volodymyr_Ialovyi @Inna_Shkolnaya
We are getting the same errors. Very hard to reproduce
Many different methods, random times.
We have a custom function we use to catch errors. Is there some specific type of log we can add there to help you identify the issue? Are there any logs on your end you can check if we provide specific timestamps?
APP ID: 4A47197B-AE30-FA84-FF56-0071F4010900
They do -
Any updates on this? I’m still getting errors -
Error: getaddrinfo ENOTFOUND bl-server
/api/services/Crons/officeinsertorders
This service gets orders from an external API, formats them, and POSTs them to another API. It is run automatically every ~5 minutes.
This has gone from an infrequent issue to something happening in multiple locations every hour.
Error: getaddrinfo ENOTFOUND bl-server
/api/services/messages/sponsorPresale
This service runs ~ 15 times every Tuesday to generate flyers. There is no input. It successfully generated 50 flyers and got the error above 3 times.
Is this related to the outage this morning?
Tim
I’m getting these too, along with calls failing with bad request errors, and the only thing I see in my logs is “Task execution is aborted due to timeout.” No idea how to reproduce it.
Hi @Tim_Jones, @Andreas_Marinopoulos, @Kelly_Oglesby
Unfortunately, I don’t have any major good news yet that everything has been fully fixed, but we are actively working on it and making progress. We are aware of the increased frequency of these errors, and we are fully focused on addressing them. We are closely monitoring the situation and working diligently to resolve the issues as quickly as possible. Thank you for your patience, and I’ll keep you updated as we continue to work through this.
Regards,
Viktor
Thank you for the update @Viktor_Mudrevsky.
Do you have a sense for how much time this might take? Hours, days, weeks?
Thanks,
Tim
As a quick update. I am now getting “Task execution is aborted due to timeout”.
I hope this helps you find the issue.
Tim
The errors are getting more frequent. Any update?
Error: Network Error
Error: socket hang
Error: connect ECONNREFUSED 10.42.4.75:9000
Hi @Tim_Jones, @Andreas_Marinopoulos, @Kelly_Oglesby,
Please check how your applications are performing today. We resolved the issue yesterday where several of our worker nodes were not functioning correctly.
Regarding the specific errors discussed in this thread, unfortunately, they may still occur for now, but their frequency should be significantly reduced. We are continuing to work on eliminating these errors - much has already been researched and accomplished, but there is still more to do and verify.
Regards,
Viktor
Things are much better! I had 1-2 errors last night and another 1-2 this morning. Please keep us updated; it would be nice to put this to bed. I started this thread more than 2 years ago!
Tim
I’m still getting a few errors still -
Error: Connection aborted due to timeout
Task execution is aborted due to timeout
Any update for us? Any explanation for the outage?
Tim
Hello @Tim_Jones
Sorry for the inconvenience and thank you for your patience.
We are working on a solution to this problem.
Regards,
Volodymyr
Thanks for the update @Volodymyr_Ialovyi.
I’m not sure if it is related, but last night around 8:03pm ET, I receive this error message about 10 times -
Error: read ECONNRESET
Thanks,
Tim