boinc

Commit Graph

Author	SHA1	Message	Date
David Anderson	c145a143b1	- client: divide LTD deltas by ninstances, same as for STD. This is cosmetic - it won't affect work fetch, but it will prevent LTD from changing faster than real time svn path=/trunk/boinc/; revision=20034	2009-12-24 17:09:27 +00:00
David Anderson	19a69b5725	- client: maintain mean STD at zero over all projects, not just runnable ones svn path=/trunk/boinc/; revision=19878	2009-12-13 00:02:56 +00:00
David Anderson	1f78855ae1	- client: a couple of switch statements were missing breaks. This would have caused work-fetch errors if using the no_cuda, no_cpu etc. prefs svn path=/trunk/boinc/; revision=19867	2009-12-12 04:28:18 +00:00
David Anderson	a151ad6cb3	- client/scheduler: deal with situation where GPU has enough RAM to run job, but when we actually run the job not enough GPU RAM is free, so the application fails. This can cause a large number of jobs to fail. Solution: - app_plan() can specify the GPU RAM requirements of an app version. This is passed to the client in a new field <gpu_ram> of the <app_version> element. - prior to starting or restarting a GPU app, the client checks the amount of free RAM on the particular GPU. If it's not enough for the app version, the client doesn't start it, and arranges for the scheduler to ignore it for 5 minutes (by which point there might be more free GPU RAM) Notes: 1) this change will have effect only when both client and scheduler are updated. 2) the check is done in enforce_schedule(), rather than schedule_cpus(), because only at that point have we assigned a specific GPU to the job. 3) there's another case to deal with: a GPU app's malloc of GPU RAM fails in the middle of the job. Currently the job fails. I plan to add an API call boinc_temporary_exit(x) so that the job can exit and potentially restart in x seconds. (In principle this mechanism is sufficient for all cases, but it could lead to a lot of starting/exiting, so the current change is worthwhile). svn path=/trunk/boinc/; revision=19864	2009-12-11 22:45:59 +00:00
David Anderson	ea54aa7759	- client: STD for a device with N instances can increase or decrease at N times real time. My checkin of 7 Dec reflects this by changing the STD limits to +- N*MAX_STD. This looks like a bug to users. Instead, scale that rate of STD change by 1/N, and keep the old limits of +- MAX_STD svn path=/trunk/boinc/; revision=19851	2009-12-10 17:07:45 +00:00
David Anderson	e9a4debf9c	- client: scheduling tweak. Old: if a project has RR sim deadline misses, select jobs to run high-priority on the basis of: 1) deadline (earliest first) 2) estimated time to completion (least first) This ignores whether jobs missed their deadline in RR sim, so it may choose to run a job that's actually in no danger of missing its deadline over one that is. New: choose only jobs that miss their deadline in RR sim svn path=/trunk/boinc/; revision=19826	2009-12-08 20:39:46 +00:00
David Anderson	b41ea18233	- client: don't set STDs for non-runnable projects to zero. Let them float around with other projects. Fixes problem where, when a project finishes its last job and has a negative STD, it gets an unfair increment by being set to zero. svn path=/trunk/boinc/; revision=19804	2009-12-07 18:58:37 +00:00
David Anderson	b70229c093	- code shuffle: move client-specific GPU code to a separate file svn path=/trunk/boinc/; revision=19794	2009-12-07 00:42:03 +00:00
David Anderson	e9fc909f3c	- client: scale STD limit by # instances svn path=/trunk/boinc/; revision=19784	2009-12-04 21:46:43 +00:00
David Anderson	2ef5c5895b	- client: fix bug in debt calculation - client: <zero_debts> zeroes STD too svn path=/trunk/boinc/; revision=19783	2009-12-04 21:21:18 +00:00
David Anderson	09759104f4	compile fixes, message tweaks svn path=/trunk/boinc/; revision=19778	2009-12-03 23:27:32 +00:00
David Anderson	2d4ceb618a	- client: my STD-related checkin of Dec 1 was bad. It computed an "overall STD" as the sum of CPU and coprocs, weighted by the coproc's speed, as we do for LTD. This was the wrong idea; in the presence of GPUs, STDs quickly get pushed to +- 1 day and are truncated there. New scheme: STD is maintained per (resource type, project). This fixes the above problem, and it opens to door to round-robin scheduling of GPUs. - client: the calculation of "anticipated debt" was scaling by relative resource share. This wasn't correct, seems to me. - client: rename "debt" to "long_term_debt" in a few places (but not in the client state file, for compatibility) svn path=/trunk/boinc/; revision=19777	2009-12-03 23:09:25 +00:00
David Anderson	fb4797adfd	- client: Add offset to LTD of non-eligible projects only if the offset is positive. - client: some cmdline args set members of config. However, config was being cleared after cmdline args were parsed, so these args had no effect. Instead, clear config before parsing cmdline svn path=/trunk/boinc/; revision=19776	2009-12-03 19:09:45 +00:00
David Anderson	59328aaccb	- client: change how short term debt is updated. Old: it's based entirely on CPU time. So a GPU project, whose app uses only a fraction of a CPU, accrues positive debt. This is OK if the project has only GPU apps, since STD is not (currently) used for GPU scheduling. But some projects have both CPU and GPU apps. New: STD is based on total processing. It has terms for each resource type. The notion of "runnable resource share" is specific to a type. Note: the notion of "resource share fraction" appears in a couple of other places: - it's passed to apps in app_init_data.xml - it's passed in scheduler requests. It should be broken down by resource type in these cases too. Note to self: do this later. svn path=/trunk/boinc/; revision=19762	2009-12-02 03:41:52 +00:00
David Anderson	67ad130477	svn path=/trunk/boinc/; revision=19761	2009-12-01 20:36:11 +00:00
David Anderson	5ac92cdc01	- client: apply the LTD normalizing offset to all projects, even non-debt-eligible ones. svn path=/trunk/boinc/; revision=19758	2009-12-01 20:01:13 +00:00
David Anderson	6fc27ffc44	- client: use [wfd] consistently svn path=/trunk/boinc/; revision=19725	2009-11-27 21:21:39 +00:00
David Anderson	56efa3ec27	- client: if a project has a no_{cpu,cuda,ati} pref set, don't accumulate debt for that resource. Otherwise we'll accumulate debt forever, pushing other projects into overworked state. svn path=/trunk/boinc/; revision=19547	2009-11-12 17:19:50 +00:00
David Anderson	545d137804	- client: no network activity if running CPU benchmarks svn path=/trunk/boinc/; revision=19375	2009-10-23 21:57:58 +00:00
David Anderson	e86584f6cc	- client: the weight of GPU debt in computing total debt should be (estimated throughput of all GPUs)/(estimated throughput of all CPUs) rather than the ratio of 1 GPU to 1 CPU. This change will hopefully cause ratios of granted credit to more closely match resource shares. svn path=/trunk/boinc/; revision=19311	2009-10-16 02:48:55 +00:00
David Anderson	fe2a18f282	- client/scheduler: standardize the FLOPS estimate between NVIDIA and ATI. Make them both peak FLOPS, according to the formula supplied by the manufacturer. The impact on the client is minor: - the startup message describing the GPU - the weight of the resource type in computing long-term debt On the server, I changed the example app_plan() function to assume that app FLOPS is 20% of peak FLOPS (that's about what it is for SETI@home) svn path=/trunk/boinc/; revision=19310	2009-10-16 00:13:01 +00:00
David Anderson	9f93535428	- client: bug fixes to the above. Don't fetch work for an unable resource. svn path=/trunk/boinc/; revision=19302	2009-10-14 19:11:11 +00:00
David Anderson	5e862ac495	- client: on startup, if a coproc needed by a job is missing, set a "coproc_missing" flag rather than aborting the job. If use removes a GPU board while there's a large queue of GPU jobs, they'll stay queued (until their deadline passes). Note: this doesn't fix the situation where user connects via Remote Desktop while GPU jobs are running or queued. We should check for Remote Desktop every minute or so, and stop GPU jobs. svn path=/trunk/boinc/; revision=19287	2009-10-12 16:28:17 +00:00
David Anderson	fb14ff8a7a	- client: fix bug where if you change project "no CPU/NVIDIA/ATI" prefs and update, the change wouldn't take effect until client restart. - client: fix bug in enforcement of "no CPU/NVIDIA/ATI" prefs svn path=/trunk/boinc/; revision=19236	2009-10-03 00:19:11 +00:00
David Anderson	71c7e7a74b	- client/scheduler/web: add per-project preferences for whether to accept CPU, NVIDIA and ATI jobs. These prefs are shown only where relevant: e.g., only for processor types for which the project has app versions, and if it has versions for only one type, no pref is shown. These prefs affect both client and scheduler. The client won't ask for work for a device blocked by prefs, and the scheduler won't send it. This replaces earlier optional project-specific prefs for "no CPU jobs" and "no GPU jobs". (However, these prefs continue to be honored on the server side). - client: if NVIDIA driver is unknown, say that rather than 0 svn path=/trunk/boinc/; revision=19194	2009-09-28 04:24:18 +00:00
David Anderson	11c023fdc6	- work fetch tweak svn path=/trunk/boinc/; revision=19123	2009-09-21 20:41:28 +00:00
David Anderson	7e69f86942	- client: don't print error msg if file is wrong size on startup svn path=/trunk/boinc/; revision=19103	2009-09-18 20:49:54 +00:00
David Anderson	41e3b06b23	- client and scheduler RPC: add optional <cpu_backoff>, <cuda_backoff>, and <ati_backoff> elements to scheduler reply. These specify backoffs for the resource types, overriding the existing backoff mechanism. Projects can supply these if they don't have apps of a particular type and don't want to get periodic requests for them. svn path=/trunk/boinc/; revision=19059	2009-09-16 17:34:19 +00:00
David Anderson	f5a6f862bf	- client: fix bug in RR simulation: start only enough jobs to fill CPUs per project, not all the CPU jobs at once. I'm not sure how much difference this makes, but this is how it's supposed to work. - client: if app_info.xml doesn't specify flops, use an estimate that takes GPUs into account. - client: if it's been more than 2 weeks since time stats update, don't decay on_frac at all. svn path=/trunk/boinc/; revision=19035	2009-09-09 22:18:02 +00:00
David Anderson	b129e71f20	- client: add code for faking ATI GPUs svn path=/trunk/boinc/; revision=19024	2009-09-08 18:42:24 +00:00
David Anderson	4c52989f59	- client: improve the estimation of "busy time" (see 17 July checkin). If you have 2 CPUs and a 1-day job in EDF mode, the busy time should be zero, not .5 days. Add a class BUSY_TIME_ESTIMATOR that makes a somewhat better (though still fairly crude) estimate. svn path=/trunk/boinc/; revision=19003	2009-09-03 20:31:04 +00:00
David Anderson	112cec62a5	- client: fix to [18945]; we only want to max the overall request with a GPU request if project is anonymous platform AND it has an app for that GPU type - client: report overall work request as well as per-resource-type requests svn path=/trunk/boinc/; revision=18994	2009-09-02 21:36:25 +00:00
David Anderson	29c1751898	- client: if project is anonymous platform, set the overall work req to the max of the requests for different resource types. Otherwise projects with old schedulers won't send us work. svn path=/trunk/boinc/; revision=18945	2009-08-31 03:42:01 +00:00
David Anderson	7e8dadd5a3	svn path=/trunk/boinc/; revision=18940	2009-08-28 21:08:59 +00:00
David Anderson	474299d778	- client: fix work fetch log message for ATI GPU svn path=/trunk/boinc/; revision=18939	2009-08-28 20:28:42 +00:00
David Anderson	c3fe504e1d	- client: add ATI support to job scheduling and work fetch svn path=/trunk/boinc/; revision=18850	2009-08-17 16:50:40 +00:00
David Anderson	160ec8daec	- client: don't try to maintain GPU work for all projects, since use FIFO rather than RR schedling for GPUs svn path=/trunk/boinc/; revision=18844	2009-08-14 17:17:54 +00:00
David Anderson	26114920fe	- client: define "too many uploads" (for work fetch) as 2 * max(ncpus, ngpus); show this in the state displayed by <work_fetch_debug> - manager: show project-wide backoff in transfers tab svn path=/trunk/boinc/; revision=18662	2009-07-22 22:00:51 +00:00
David Anderson	5753153909	- client: 2nd try on my last checkin. We need to estimate 2 different delays for each resource type: 1) "saturated time": the time the resource will be fully utilized (new name for the old "estimated delay"). This is used to compute work requests. 2) "busy time": the time a new job would have to wait to start using this resource. This is passed to the scheduler and used for a crude deadline check. Note: this is ill-defined; a single number doesn't suffice. But as a very rough estimate, I'll use the sum of (J.duration * J.ninstances)/ninstances over all jobs that miss their deadline under RR sim. svn path=/trunk/boinc/; revision=18629	2009-07-17 18:29:10 +00:00
David Anderson	8a1c0816ed	- client: change the way a resource's "estimated delay" (passed to server for crude deadline check) is computed. Old: estimated delay is the interval for which the resource is fully used (i.e., all instances busy). Problem: this may cause unnecessary project starvation. example: 1 CPU machine, has a month-long CPDN job with a 1-year deadline (it's not in deadline trouble). Then the CPU estimated delay will be 1 month, and the client won't get any work from projects with deadlines shorter than 1 month. New: estimated delay is the latest time at which the resource is fully used and is being used by at least 1 job that is projected to miss its deadline under RR. Note: this isn't precise, but I don't think we can improve it much without getting a lot more complex. svn path=/trunk/boinc/; revision=18607	2009-07-16 21:21:47 +00:00
David Anderson	9ec90c409e	- GUI RPC: add get_message_seqno() RPC. fixes #931 svn path=/trunk/boinc/; revision=18576	2009-07-07 22:58:58 +00:00
David Anderson	d39af2afd2	- client: add a 1e-6 slop factor in deciding if a resource is fully utilized. svn path=/trunk/boinc/; revision=18271	2009-06-02 23:26:12 +00:00
David Anderson	c2097091fe	- client: show "est. delay" correctly in work fetch debug msgs - client: show times correctly in rr_sim debug msgs - client: in "requesting new tasks" msg, say what resources we're requesting (if there's more than CPU) - client: estimated delay was possibly being calculated incorrectly because of roundoff error svn path=/trunk/boinc/; revision=18269	2009-06-02 22:53:57 +00:00
David Anderson	4a5b1fe673	- client: tweak to avoid -0 svn path=/trunk/boinc/; revision=18072	2009-05-11 19:07:12 +00:00
David Anderson	08aeba56ac	- client: message tweak - ops: don't import teams if using invitation codes svn path=/trunk/boinc/; revision=17874	2009-04-24 10:49:54 +00:00
David Anderson	cf638ae3a6	- client: instead of scheduling coproc jobs EDF: - first schedule jobs projected to miss deadline in EDF order - then schedule remaining jobs in FIFO order This is intended to reduce the number of preemptions of coproc jobs, and hence (since they are always preempted by quit) to reduce the wasted time due to checkpoint gaps. - client: the CPU scheduling policy made use of the number of deadline misses in various places. This should include only the deadline misses of CPU jobs. So move "deadlines_missed" from RR_SIM_STATUS and PROJECT to RSC_PROJECT_WORK_FETCH so that we have separate counts for CPU and coproc jobs, and use the count for CPU jobs. - GUI RPC: removed the rr_sim_deadlines_missed field from project descriptor. This is no longer meaningful, and it didn't seem to be used anywhere. svn path=/trunk/boinc/; revision=17785	2009-04-10 19:01:38 +00:00
David Anderson	837d3fc0a1	- get_project_config.php: include plan classes in platform list; i.e., list both win/x86 and win/x86 + NVIDIA. This will allow the manager to show which projects can use the hosts's coprocessors, and also graying out projects that require an absent coproc. - fix compile warnings svn path=/trunk/boinc/; revision=17735	2009-04-03 21:55:26 +00:00
David Anderson	f90871a141	- boinc_submit (single-job submission): set the job params to reasonable values (see below), and make it easy to change these values in the script - create_work (function and script): change default job params: FLOPs est: 1e9 => 3600e9 FLOPs bound: 1e10 => 86400e9 mem bound 100MB => 500MB, disk bound 100MB => 1GB delay bound: 100000s => 1 week svn path=/trunk/boinc/; revision=17703	2009-03-30 18:38:11 +00:00
David Anderson	62ac6358e6	- client: typo in work fetch code; may cause minor errors svn path=/trunk/boinc/; revision=17691	2009-03-28 04:20:44 +00:00
David Anderson	3e04801942	- client: clear resource backoffs on user-requested RPC - client: randomize resource backoffs to avoid lockstep svn path=/trunk/boinc/; revision=17664	2009-03-26 16:56:20 +00:00

1 2 3

119 Commits