Message boards : Number crunching : Problems and Technical Issues with Rosetta@home
Previous · 1 . . . 334 · 335 · 336 · 337
Author | Message |
---|---|
Sid Celery Send message Joined: 11 Feb 08 Posts: 2354 Credit: 44,881,799 RAC: 29,695 ![]() |
My uncredited Validate Errors have dropped to only 1 across all my PCs, but I note Tom M has suffered 15, losing ~6600 credits and his average has dropped from 115k to 106k - still 11% higher than before, but much lower than it initially jumped up to. Weirdly, I'm glad to read that. If nothing was going on with your system and these problems appeared I'd worry because you'd never know when it was going to reappear. Tasks seem to be sensitive to things like a power outage, so whatever happened there is likely to have resulted in the validation errors. Plus, I imagine, downtime during that period will have resulted in your credit loss and, yes, consistent running should boost you back up. If not, that is, for the severe lack of tasks atm - down to under 59k in progress. I see you're down to 133 tasks for your 128 threads, so you're on the brink of idle threads - not for anything you've done, just a lack of tasks for everyone. So frustrating when there are still 4.8 million tasks queued on the front page, but not made ready for downloading. ![]() ![]() |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2354 Credit: 44,881,799 RAC: 29,695 ![]() |
Sid: You made the change and you double-checked it's still there. That's good - you can't do any more than that. As one very final check, do another Rosetta update, then look at your Event log. It should look like this 27/05/2025 12:26:01 | Rosetta@home | Sending scheduler request: To fetch work. 27/05/2025 12:26:01 | Rosetta@home | Requesting new tasks for CPU 27/05/2025 12:26:09 | Rosetta@home | Scheduler request completed: got 0 new tasks 27/05/2025 12:26:09 | Rosetta@home | No tasks sent 27/05/2025 12:26:09 | Rosetta@home | Project requested delay of 31 seconds That is, you asked for tasks and you "got 0 new tasks" Because there are so few to send to anyone right now. What it would've said in the past is something like 'error in server feed' (forgot the exact wording but something to do with errors and servers) That is, it couldn't ask for tasks in the first place. That's the problem your hosts edit fixed and why you received nothing for months. That you got nothing through straight away isn't a surprise. When I check my own log I see multiple attempts and it's 'got 0 new tasks' most of the time. But over maybe 24hrs, after multiple attempts, the occasional task does slip through. That's what I'd hope you see soon. ![]() ![]() |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2354 Credit: 44,881,799 RAC: 29,695 ![]() |
But over maybe 24hrs, after multiple attempts, the occasional task does slip through. That's what I'd hope you see soon. As an example, minutes after my last post, this happened on the machine I'm posting from atm 27/05/2025 12:47:36 | Rosetta@home | Sending scheduler request: To fetch work. 27/05/2025 12:47:36 | Rosetta@home | Reporting 2 completed tasks 27/05/2025 12:47:36 | Rosetta@home | Requesting new tasks for CPU 27/05/2025 12:47:41 | Rosetta@home | Scheduler request completed: got 3 new tasks 27/05/2025 12:47:41 | Rosetta@home | Project requested delay of 31 seconds I have no idea how I keep getting sufficient tasks on all my PCs when there are so few to grab. I'm having some kind of miracle-run atm Edit: and now this. It's inexplicable tbh 27/05/2025 13:11:59 | Rosetta@home | Sending scheduler request: To fetch work. 27/05/2025 13:11:59 | Rosetta@home | Reporting 1 completed tasks 27/05/2025 13:11:59 | Rosetta@home | Requesting new tasks for CPU 27/05/2025 13:12:03 | Rosetta@home | Scheduler request completed: got 2 new tasks 27/05/2025 13:12:03 | Rosetta@home | Project requested delay of 31 seconds Edit 2: Checking further, 26 grabbed in the last 14hrs running on a 16-thread PC with 12hr runtimes. 8 of them already running and 8 more due to start in the next hour or two - I'm only running a relatively small offline cache ![]() ![]() |
Tom M Send message Joined: 20 Jun 17 Posts: 137 Credit: 30,334,776 RAC: 101,485 ![]() |
This computer is already running WCG, Einstein, Milky Way and Asteroids, in addition to Rosetta. It has been crunching them for some time without problems. I try not to let it overheat by keeping it under 190 degrees F.I haven't received any Rosetta tasks in a month or so. I believe that after you make changes to the hosts file on ANY operating system you need to restart the whole system for the changes to take effect. I know it works on both my Linux and Windows systems. Proud member of the O.F.A. (Old Farts Association) |
Jean-David Beyer Send message Joined: 2 Nov 05 Posts: 220 Credit: 7,496,415 RAC: 1,064 ![]() |
I believe that after you make changes to the hosts file on ANY operating system you need to restart the whole system for the changes to take effect. I know it works on both my Linux and Windows systems. I run Red Hat Enterprise Linux release 8.10 (Ootpa) and when I changed my /etc/hosts file, it took effect immediately. No need to restart the system or even the Boinc client. ![]() |
Tom M Send message Joined: 20 Jun 17 Posts: 137 Credit: 30,334,776 RAC: 101,485 ![]() |
I believe that after you make changes to the hosts file on ANY operating system you need to restart the whole system for the changes to take effect. I know it works on both my Linux and Windows systems. I will admit, I am basing my Linux results on when I changed system names in the hosts file. I had to either run some exotic linux command or restart the system. Based on the Windows results mentioned above I suspect windows only reads it's hosts file during startup. Respectfully, Proud member of the O.F.A. (Old Farts Association) |
Tom M Send message Joined: 20 Jun 17 Posts: 137 Credit: 30,334,776 RAC: 101,485 ![]() |
https://boinc.bakerlab.org/rosetta/show_host_detail.php?hostid=6327403 My cache is down to 92 tasks. And my WCG with resource "0" is currently running about 22 tasks. I have started up my update script to see if I can suck down some more. Proud member of the O.F.A. (Old Farts Association) |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2354 Credit: 44,881,799 RAC: 29,695 ![]() |
I believe that after you make changes to the hosts file on ANY operating system you need to restart the whole system for the changes to take effect. I know it works on both my Linux and Windows systems. Yes, Windows users reported this too. I guess it's possible it may not work that way for everyone, so by all means reboot if the Event Log error message related to servers persists, but otherwise it's reportedly fine straight away. ![]() ![]() |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2354 Credit: 44,881,799 RAC: 29,695 ![]() |
https://boinc.bakerlab.org/rosetta/show_host_detail.php?hostid=6327403 Ugh. This is where having a larger cache than the barest minimum helped, and longer than default 8hr (or borked 3hr) runtimes too, for a while, but it's still not sufficient. The rate your 128 threads plough through Rosetta tasks makes it hard to keep up. Meanwhile my 16-thread PC has grabbed 11 tasks in the last 40 minutes - I'm leading a charmed life atm. In progress tasks are down further to 55k. One more appeal to those still using default runtimes that turn out to only last 3hrs to explicitly set it to 8hrs in their online Rosetta settings ![]() ![]() |
![]() Send message Joined: 28 Mar 20 Posts: 1846 Credit: 18,534,891 RAC: 0 |
I believe that after you make changes to the hosts file on ANY operating system you need to restart the whole system for the changes to take effect.Nope. Although you can ipconfig /flushdns to clear the cache. Grant Darwin NT |
Tom M Send message Joined: 20 Jun 17 Posts: 137 Credit: 30,334,776 RAC: 101,485 ![]() |
https://boinc.bakerlab.org/rosetta/show_host_detail.php?hostid=6327403 It is back up to 200+ So the only real question is how much loss of CPU production to run the gpu's well it effect my Rosetta rac in the long run. Proud member of the O.F.A. (Old Farts Association) |
Stevie G Send message Joined: 15 Dec 18 Posts: 122 Credit: 1,011,942 RAC: 4 |
Sid: Here's what my Event Log said: 5/28/2025 9:48:51 AM | Rosetta@home | update requested by user 5/28/2025 9:48:51 AM | Rosetta@home | Sending scheduler request: Requested by user. 5/28/2025 9:48:51 AM | Rosetta@home | Not requesting tasks: don't need (CPU: ; AMD/ATI GPU: ) 5/28/2025 9:48:52 AM | Rosetta@home | Scheduler request completed 5/28/2025 9:48:52 AM | Rosetta@home | Server error: feeder not running 5/28/2025 9:48:52 AM | Rosetta@home | Project requested delay of 3600 seconds 5/28/2025 9:51:42 AM | Milkyway@home | Starting task de_nbody_orbit_fitting_03_25_2025_v186_OCS__data__33_1740880091_2434580_2 I've re-started the system several times, the latest last night because I vacuumed dust and cat hair out. I'll be patient. Maybe some tasks will come my way. S. Gaber Oldsmar, FL |
Tom M Send message Joined: 20 Jun 17 Posts: 137 Credit: 30,334,776 RAC: 101,485 ![]() |
Sid: 5/28/2025 9:48:51 AM | Rosetta@home | Not requesting tasks: don't need (CPU: ; AMD/ATI GPU: ) Usually this means you have other projects that have filled your cache up. So you might want to No New Task all your other CPU projects until you start getting some Rosetta downloads. Proud member of the O.F.A. (Old Farts Association) |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2354 Credit: 44,881,799 RAC: 29,695 ![]() |
One more appeal to those still using default runtimes that turn out to only last 3hrs to explicitly set it to 8hrs in their online Rosetta settings 279 now - take that all day long. When running a GPU project alongside, they do require some CPU support aiui, so inevitably you'll lose some Rosetta RAC, but I have little to no experience with them. Better that someone else chips in here. What GPU project is it? (Different ones may require a different amount of support - I don't know for sure) ![]() ![]() |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2354 Credit: 44,881,799 RAC: 29,695 ![]() |
You made the change and you double-checked it's still there. Oh dear. The critical lines of your event log are the last two Server error: feeder not running <- the wording I couldn't remember, indicating it's still not pointing at a working server Project requested delay of 3600 seconds <- delaying the retry for an hour to give the server a chance to come back. Except it won't You posted your host file here and said you double-checked it's still in place, unchanged. I don't know why this has happened - you seem to have done all the right things. From those event log lines, you still aren't going to get new tasks and I'm completely out of ideas now :( Edit: Tom fairly mentioned that your system wasn't asking for new tasks. I'm not sure that explains the error message and backoff. I don't know why it doesn't think you need more if you've had none for months - that sounds like a separate issue - but even if it did ask for tasks I'm inclined to think the next two lines will come into play and prevent it. ![]() ![]() |
kotenok2000 Send message Joined: 22 Feb 11 Posts: 284 Credit: 530,487 RAC: 33 |
Is it hosts or hosts.txt? It needs to be hosts without an extension. |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2354 Credit: 44,881,799 RAC: 29,695 ![]() |
Is it hosts or hosts.txt? Fair question actually - it might be as simple a thing as that ![]() ![]() |
Tom M Send message Joined: 20 Jun 17 Posts: 137 Credit: 30,334,776 RAC: 101,485 ![]() |
One more appeal to those still using default runtimes that turn out to only last 3hrs to explicitly set it to 8hrs in their online Rosetta settings I run Einstein at home as my primary GPU project. But when they are available I run gpugrid (medical/biology) tasks. Which are NVIDIA only. https://einsteinathome.org/host/13227356 In general I want to maximize my biology/medical research but have found I want my Einstein at home RAC to stay at 3M or above. I am in transition so I have just rejoined the e@h Top 50. And look like I will get back up to 3M or so. If I could afford the electric bill as well as the cost of the gpu's I could run 7 gpu's fairly easily. But that would subtract between 14 and 21 cpu threads from the cpu-only projects. Which means I would be less productive in Rosetta or WCG. So running three gpus is about right (6-9 cpu threads). . Proud member of the O.F.A. (Old Farts Association) |
Stevie G Send message Joined: 15 Dec 18 Posts: 122 Credit: 1,011,942 RAC: 4 |
Tom: You said "5/28/2025 9:48:51 AM | Rosetta@home | Not requesting tasks: don't need (CPU: ; AMD/ATI GPU: ) Usually this means you have other projects that have filled your cache up. So you might want to No New Task all your other CPU projects until you start getting some Rosetta downloads." I have paused other projects or requested no new tasks for a day or two. And still didn't get any Rosetta tasks. That's why I said I was ready to give up on Rosetta. Event log today: 5/29/2025 2:10:43 PM | Asteroids@home | project suspended by user 5/29/2025 2:10:47 PM | Einstein@Home | project suspended by user 5/29/2025 2:10:48 PM | Universe@Home | Sending scheduler request: To fetch work. 5/29/2025 2:10:48 PM | Universe@Home | Requesting new tasks for CPU 5/29/2025 2:10:49 PM | Milkyway@home | project suspended by user 5/29/2025 2:10:50 PM | Universe@Home | Scheduler request completed: got 0 new tasks 5/29/2025 2:10:50 PM | Universe@Home | Project has no tasks available 5/29/2025 2:10:50 PM | Universe@Home | Project requested delay of 11 seconds 5/29/2025 2:10:53 PM | World Community Grid | project suspended by user 5/29/2025 2:10:55 PM | Rosetta@home | Sending scheduler request: To fetch work. 5/29/2025 2:10:55 PM | Rosetta@home | Requesting new tasks for CPU and AMD/ATI GPU 5/29/2025 2:10:57 PM | Rosetta@home | Scheduler request completed: got 0 new tasks 5/29/2025 2:10:57 PM | Rosetta@home | Server error: feeder not running 5/29/2025 2:10:57 PM | Rosetta@home | Project requested delay of 3600 seconds 5/29/2025 2:10:59 PM | Rosetta@home | update requested by user 5/29/2025 2:11:02 PM | Rosetta@home | Sending scheduler request: Requested by user. 5/29/2025 2:11:02 PM | Rosetta@home | Requesting new tasks for CPU and AMD/ATI GPU 5/29/2025 2:11:03 PM | Rosetta@home | Scheduler request completed: got 0 new tasks 5/29/2025 2:11:03 PM | Rosetta@home | Server error: feeder not running 5/29/2025 2:11:03 PM | Rosetta@home | Project requested delay of 3600 seconds S. Gaber Oldsmar, FL |
Message boards :
Number crunching :
Problems and Technical Issues with Rosetta@home
©2025 University of Washington
https://www.bakerlab.org