boinc

Commit Graph

Author	SHA1	Message	Date
David Anderson	6b8a569d6d	- client/scheduler: fix a group of bugs related to the new mechanism where the client tells the scheduler which app versions its queued jobs use (this is needed, e.g., to enforce per-app or per-resource job limits). In this mechanism, the client sends an array of <app_version>s, and each <other_result> includes an index into this array. - The wrong index was being sent (client). - If an <app_version> had a non-existent app name (e.g. because that app had been deprecated) it wasn't getting put in the array, invalidating array indices Furthermore, an erroneous message was being sent to the user Fix: if parse error for <app_version>, put it in the array anyway, but with cav.app = NULL, meaning that it's a place-holder. Send a message to user only if anon platform. - manager: increase notice buffers to 64K svn path=/trunk/boinc/; revision=22052	2010-07-23 17:43:20 +00:00
David Anderson	faab0991f7	- scheduler: fix and restore fpops scaling for anonymous platform jobs svn path=/trunk/boinc/; revision=21962	2010-07-15 21:38:24 +00:00
David Anderson	55e0e86c90	- scheduler: make messages translatable svn path=/trunk/boinc/; revision=21896	2010-07-13 02:49:35 +00:00
David Anderson	e53e9710e8	- scheduler: make some "notice"-priority messages translatable - scheduler: add a clause to wu_is_infeasible_custom() for SETI@home: don't process VLAR jobs using CUDA apps. Note: this is implemented in a slightly non-optimal way. If the request asks for both GPU and CPU jobs, the scheduler will first decide to use the GPU version. It will scan jobs, skipping over VLAR jobs. When the GPU request is satisfied, it will switch to the CPU version and continue scanning, accepting VLAR jobs. But the jobs that were skipped initially won't be rescanned. Also, it would be slightly nice to preferentially send VLAR jobs to hosts asking for CPU work. (This could be done in the scoring function). svn path=/trunk/boinc/; revision=21895	2010-07-12 22:43:53 +00:00
David Anderson	b1851ce02c	- user web: PHP 5.3 compatibility fix, from Nicolas. Fixes #787 svn path=/trunk/boinc/; revision=21878	2010-07-06 23:31:26 +00:00
David Anderson	d756994bda	- scheduler and back end: message tweaks and fixes svn path=/trunk/boinc/; revision=21835	2010-06-29 03:20:19 +00:00
David Anderson	7c51512cbf	- transitioner: the format string for a DB query had %.15d instead of %.15e. That produced a messed-up query that assigned garbage values to: host_app_version.turnaround_var host_app_version.turnaround_q host_app_version.max_jobs_per_day host_app_version.consecutive_valid To repair these: - set turnaround_var and turnaround_q to zero - if max_jobs_per_day is outside of (0..config.daily_result_quota) set it to config.daily_result_quota - if consecutive_valid is outside (0..1000), set it to zero I added a script, html/ops/repair_21812.php, that does this; if you ran server code between [21181] and [21812], run this script. - scheduler/transitioner: add <debug_quota> log flag - changed the build system to always use -Wall (if we'd done this before, this bug wouldn't have happened) - fixed a bunch of other compile warnings svn path=/trunk/boinc/; revision=21812	2010-06-25 18:54:37 +00:00
David Anderson	587a4cde3f	- scheduler: msg tweaks svn path=/trunk/boinc/; revision=21805	2010-06-24 22:58:05 +00:00
David Anderson	ae7866b251	- scheduler: restore scaling of daily quota by # processors and/or config.gpu_multiplier - client: msg tweak svn path=/trunk/boinc/; revision=21753	2010-06-15 22:21:57 +00:00
David Anderson	f849faea5e	- scheduler: bug fixes for jobs-in-progress limits - client: msg tweak svn path=/trunk/boinc/; revision=21692	2010-06-04 16:57:33 +00:00
David Anderson	e80e54fd4d	- user web: add "Application info" link in host page, linking to new page showing host_app_versions for this host - scheduler: message tweaks svn path=/trunk/boinc/; revision=21690	2010-06-03 20:26:02 +00:00
David Anderson	cf7fb29227	- scheduler: add fine-grained "max jobs in progress" control. You can now specify limits for specific apps, and/or for the project as a whole. Within each of these, you can specify limits on CPU jobs, GPU jobs, or total jobs. In the case of CPU and GPU limits, you can specify whether the limit should be scaled by the number of devices. Note: the enforcement of this is done in get_app_version(), since per-resource-type limits may dictate what app versions we can use for a particular job. svn path=/trunk/boinc/; revision=21674	2010-06-01 23:41:07 +00:00
David Anderson	ca239d913a	- scheduler: fix memory leak (free BEST_APP_VERSION objects) svn path=/trunk/boinc/; revision=21597	2010-05-21 21:49:54 +00:00
David Anderson	fa66519441	- scheduler: SETI@home's CUDA and CUDA 2.3 apps apparently don't run on Fermi (compute capability 2) hardware. Temporary solution: change app_plan() accordingly - scheduler: message tweaks svn path=/trunk/boinc/; revision=21595	2010-05-20 22:49:00 +00:00
David Anderson	7a7cf4f5e7	- client, Unix: error checking in reading /proc entries. Avoid garbage values e.g. of working_set_size - scheduler: message tweaks svn path=/trunk/boinc/; revision=21591	2010-05-20 17:50:00 +00:00
David Anderson	5470d7289a	- scheduler: fix bug in daily job quota check svn path=/trunk/boinc/; revision=21506	2010-05-13 16:45:27 +00:00
David Anderson	7688a6c5d6	- scheduler: fix for daily quota enforcement svn path=/trunk/boinc/; revision=21495	2010-05-12 21:24:52 +00:00
David Anderson	63dcfabe0e	- scheduler: changeset 21148 broke the scheduler. We store pointers to BEST_APP_VERSION in both APP_VERSION and RESULT. We can't then fiddle with the vector that these point into. Switch back to using a vector of pointers. This restores the memory leak, which I'll deal with later. svn path=/trunk/boinc/; revision=21494	2010-05-12 21:07:39 +00:00
David Anderson	021edb02c2	- back end programs: improve log msgs svn path=/trunk/boinc/; revision=21193	2010-04-16 18:07:08 +00:00
David Anderson	b2451544e1	- server: change the following from per-host to per-(host, app version): - daily quota mechanism - reliable mechanism (accelerated retries) - "trusted" mechanism (adaptive replication) - scheduler: enforce host scale probation only for apps with host_scale_check set. - validator: do scale probation on invalid results (need this in addition to error and timeout cases) - feeder: update app version scales every 10 min, not 10 sec - back-end apps: support --foo as well as -foo for options Notes: - If you have, say, cuda, cuda23 and cuda_fermi plan classes, a host will have separate quotas for each one. That means it could error out on 100 jobs for cuda_fermi, and when its quota goes to zero, error out on 100 jobs for cuda23, etc. This is intentional; there may be cases where one version works but not the others. - host.error_rate and host.max_results_day are deprecated TODO: - the values in the app table for limits on jobs in progress etc. should override rather than config.xml. Implementation notes: scheduler: process_request(): read all host_app_versions for host at start; Compute "reliable" and "trusted" for each one. write modified records at end get_app_version(): add "reliable_only" arg; if set, use only reliable versions skip over-quota versions Multi-pass scheduling: if have at least one reliable version, do a pass for jobs that need reliable, and use only reliable versions. Then clear best_app_versions cache. Score-based scheduling: for need-reliable jobs, it will pick the fastest version, then give a score bonus if that version happens to be reliable. When get back a successful result from client: increase daily quota When get back an error result from client: impose scale probation decrease daily quota if not aborted Validator: when handling a WU, create a vector of HOST_APP_VERSION parallel to vector of RESULT. Pass it to assign_credit_set(). Make copies of originals so we can update only modified ones update HOST_APP_VERSION error rates Transitioner: decrease quota on timeout svn path=/trunk/boinc/; revision=21181	2010-04-15 03:13:56 +00:00
David Anderson	2e41153d8b	- scheduler: fix egregious bug which limited sending to 1 job per RPC - scheduler: fix bug that broke anon platform Note: Bruce Allen once advised me to take a few days and just observe BOINC in action. I should really do this more often; it always turns up bugs and/or design flaws. svn path=/trunk/boinc/; revision=21165	2010-04-11 04:42:52 +00:00
David Anderson	e05a479f42	- scheduler and validator: distinguish between 1) peak FLOPS (based on benchmarks or GPU attributes). This does not change over time. It's not adjusted on the basis of statistics. It's not affected by wu.rsc_fpops_est. It can be compared across projects. versus 2) projected FLOPS: the scheduler's best guess as to what will satisfy X * elapsed_time = wu.rsc_fpops_est; this is used to make server-side runtime estimates, and it's sent to the client and used for its runtime estimates. It may be based on the (host, app version) elapsed time average. My checkin [21153] mistakently confounded these two. Notes: 1) app_plan() now must return both peak and projected FLOPS. 2) result.flops_estimate stores peak FLOPS 3) the <flops> field in app_info.xml files should be projected FLOPS. But its accuracy is not important; it's not used once the server has statistics for the (host, app version) svn path=/trunk/boinc/; revision=21164	2010-04-10 05:49:51 +00:00
David Anderson	1d765245ed	- scheduler: sweeping changes to the way job runtimes are estimated: see http://boinc.berkeley.edu/trac/wiki/RuntimeEstimation svn path=/trunk/boinc/; revision=21153	2010-04-08 23:14:47 +00:00
David Anderson	85e06afe4b	- scheduler: app_plan() no longer has to guess how efficiently an app version will run on a particular host. - scheduler: fix memory leak: BEST_APP_VERSIONs weren't being freed svn path=/trunk/boinc/; revision=21148	2010-04-08 18:27:27 +00:00
David Anderson	71c7e7a74b	- client/scheduler/web: add per-project preferences for whether to accept CPU, NVIDIA and ATI jobs. These prefs are shown only where relevant: e.g., only for processor types for which the project has app versions, and if it has versions for only one type, no pref is shown. These prefs affect both client and scheduler. The client won't ask for work for a device blocked by prefs, and the scheduler won't send it. This replaces earlier optional project-specific prefs for "no CPU jobs" and "no GPU jobs". (However, these prefs continue to be honored on the server side). - client: if NVIDIA driver is unknown, say that rather than 0 svn path=/trunk/boinc/; revision=19194	2009-09-28 04:24:18 +00:00
David Anderson	eafb410cf8	- scheduler: simplify and fix the way that app_plan() conveys messages to the user. app_plan() now generates the messages directly rather than returning integer error codes. svn path=/trunk/boinc/; revision=18899	2009-08-21 20:38:39 +00:00
David Anderson	9e9f2a9878	- scheduler: code cleanup svn path=/trunk/boinc/; revision=18896	2009-08-21 19:14:15 +00:00
David Anderson	7278ab1787	- scheduler: add support for ATI GPUs svn path=/trunk/boinc/; revision=18851	2009-08-17 17:07:38 +00:00
David Anderson	b300519444	svn path=/trunk/boinc/; revision=18825	2009-08-10 04:49:02 +00:00
David Anderson	f163897d8a	- scheduler: add plan class for CUDA 2.3 svn path=/trunk/boinc/; revision=18804	2009-08-03 21:30:19 +00:00
David Anderson	e3363c7eb8	- scheduler: on second thought, it would be better to add the above feature without requiring use of score-based scheduling. So add a new customizable function, wu_is_infeasible_custom(), where projects can put job-specific checks. Also, move customizable functions (of which there are now 4) to a new file, sched_customize.cpp. svn path=/trunk/boinc/; revision=18767	2009-07-29 18:55:50 +00:00
David Anderson	3ff7a0d023	- scheduler: return better message if client has too little GPU RAM, wrong driver version, etc. (tell them what the specific requirement is) svn path=/trunk/boinc/; revision=18215	2009-05-28 16:37:26 +00:00
David Anderson	84afd18450	- scheduler: move app-version selection and score-based scheduling to new files. svn path=/trunk/boinc/; revision=17630	2009-03-19 16:35:35 +00:00

1 2

83 Commits