boinc

Commit Graph

Author	SHA1	Message	Date
David Anderson	c9c9f2bae0	- scheduler: code shuffle; new file sched_check.cpp contains functions that decide whether a job can be sent to a host	2013-04-09 12:19:00 -07:00
Bernd Machenschalk	444cc65722	make extern declaration of fpops_to_credit() match implementation	2013-03-01 16:26:08 +01:00
David Anderson	8fda6c0497	- Vbox wrapper: add --trickle x option; sends a trickle-up message reporting incremental runtime exery x seconds of runtime. - client: more XML parsing cleanup - credit trickle handler: do sanity checks on CPU speed svn path=/trunk/boinc/; revision=24017	2011-08-21 11:18:08 +00:00
David Anderson	8ca24cbbab	- client, work fetch policy: adjust project REC by the amount of work queued, to increase variety NOTE: at some point I think I had a reason to not do this, but I can't remember what it is. - client, job scheduling policy: fix how project REC is adjusted svn path=/trunk/boinc/; revision=23838	2011-07-13 19:46:03 +00:00
David Anderson	732866b8aa	- back end: add two example trickle handlers: trickle_credit: grants credit based on CPU time reported in msg trickle_echo: echoes trickle-up as a trickle-down svn path=/trunk/boinc/; revision=23118	2011-02-27 00:10:14 +00:00
David Anderson	4147249de2	- server: delete old credit stuff - user web: show host link in user result list. Fixes #999 svn path=/trunk/boinc/; revision=21735	2010-06-12 22:08:15 +00:00
David Anderson	7802d3f253	- server: change credit constant from 100 to 200. Data from S@h beta (running for a month or so) suggests that this will almost exactly match the average credit granted by S@h's old system (which, in theory, is the reference for other projects) svn path=/trunk/boinc/; revision=21727	2010-06-10 18:05:42 +00:00
David Anderson	b8df52bc8a	- client: temporarily enable logic that deallocates memory on exit, so that we can look for memory leaks. - client: enable bandwidth quota limit only if both #MB and #days are nonzero. - scheduler: when resending work, don't send more than client is requesting - scheduler: restore Cobblestone factor to 100 svn path=/trunk/boinc/; revision=21460	2010-05-11 19:50:14 +00:00
David Anderson	5035007b90	- back end: new way of deciding: - whether host is "reliable" for an app version - whether host is eligible for single replication for an app version - whether to use host scaling In each case, the answer is yes if the number of consecutive valid results is above a threshold. This replaces existing "error rate" and "scale probation" mechanisms. TODO: the # of consecutive valid results should also determine a limit on jobs in progress for an app version. Namely, if N is the threshold for host scaling, the limit should be ndevices*(max(1, consecutive_valid - N)) The client currently doesn't supply enough app version info to do this. It could be approximated; that would give some protection against cherry-picking. - credit: more conservative formulas for combining claimed credit among replicas. If there are normal replicas, we use a "low average" that weights each sample by the sum of the other samples. Otherwise we use the min (not the average) of the approximate samples. NOTE: a DB update is required svn path=/trunk/boinc/; revision=21230	2010-04-21 19:33:20 +00:00
David Anderson	b2451544e1	- server: change the following from per-host to per-(host, app version): - daily quota mechanism - reliable mechanism (accelerated retries) - "trusted" mechanism (adaptive replication) - scheduler: enforce host scale probation only for apps with host_scale_check set. - validator: do scale probation on invalid results (need this in addition to error and timeout cases) - feeder: update app version scales every 10 min, not 10 sec - back-end apps: support --foo as well as -foo for options Notes: - If you have, say, cuda, cuda23 and cuda_fermi plan classes, a host will have separate quotas for each one. That means it could error out on 100 jobs for cuda_fermi, and when its quota goes to zero, error out on 100 jobs for cuda23, etc. This is intentional; there may be cases where one version works but not the others. - host.error_rate and host.max_results_day are deprecated TODO: - the values in the app table for limits on jobs in progress etc. should override rather than config.xml. Implementation notes: scheduler: process_request(): read all host_app_versions for host at start; Compute "reliable" and "trusted" for each one. write modified records at end get_app_version(): add "reliable_only" arg; if set, use only reliable versions skip over-quota versions Multi-pass scheduling: if have at least one reliable version, do a pass for jobs that need reliable, and use only reliable versions. Then clear best_app_versions cache. Score-based scheduling: for need-reliable jobs, it will pick the fastest version, then give a score bonus if that version happens to be reliable. When get back a successful result from client: increase daily quota When get back an error result from client: impose scale probation decrease daily quota if not aborted Validator: when handling a WU, create a vector of HOST_APP_VERSION parallel to vector of RESULT. Pass it to assign_credit_set(). Make copies of originals so we can update only modified ones update HOST_APP_VERSION error rates Transitioner: decrease quota on timeout svn path=/trunk/boinc/; revision=21181	2010-04-15 03:13:56 +00:00
David Anderson	1d765245ed	- scheduler: sweeping changes to the way job runtimes are estimated: see http://boinc.berkeley.edu/trac/wiki/RuntimeEstimation svn path=/trunk/boinc/; revision=21153	2010-04-08 23:14:47 +00:00
David Anderson	2536797068	- validator: remove update_credit_per_cpu_sec(). Irrelevant. TODO: remove related code - validator: update wu.canonical_credit correctly. However, this field should be deprecated. - validator: check for error return from assign_credit_set(). svn path=/trunk/boinc/; revision=21096	2010-04-05 20:03:54 +00:00
David Anderson	a2a661993b	- validator: -d 4 means -d 3 plus print all DB queries (todo: do this for all daemons) - validator: change cmdline args from -foo to --foo (todo: do this for all daemons) - validator: pass max_granted_credit to assign_credit_set() svn path=/trunk/boinc/; revision=21093	2010-04-05 18:59:16 +00:00
David Anderson	fb851311e0	- server: various changes; see http://boinc.berkeley.edu/trac/wiki/CreditNew Projects will need to update DB and recompile all back-end programs. Summary: - new way of computing credit - "reliable host" mechanism is per app version - "host punishment" mechanism is per app version - adjustment of wu.rsc_fpops_est provides the equivalent of per app version DCF - max jobs in progress is now per app - max jobs per RPC is now per app TODO: - reliable mechanism: - populate and use host_app_version.error_rate - populate host_app_version.turnaround - host punishment: - populate host_app_version.max_jobs_per_day - populate host_app_version.n_jobs_today - use app.max_jobs_per_day_init - job limits: - use app.max_jobs_in_progress, max_gpu_jobs_in_progress - use app.max_jobs_per_rpc - adjust wu.rsc_fpops_est - remove old credit stuff fpops_cumulative, credit_multiplier credit computation in scheduler - AVERAGE class: use the Knuth algorithm (Wikipedia) svn path=/trunk/boinc/; revision=21021	2010-03-29 22:28:20 +00:00
David Anderson	fb443e5c31	- compile fixes svn path=/trunk/boinc/; revision=18832	2009-08-13 03:35:26 +00:00
David Anderson	3fb7c8f13f	- server code: moved everything related to credit-granting to credit.cpp, where it can be used by trickle handlers as well as by validators. svn path=/trunk/boinc/; revision=18831	2009-08-12 16:26:43 +00:00
David Anderson	f6d3e8a477	svn path=/trunk/boinc/; revision=18829	2009-08-11 15:17:37 +00:00

17 Commits