boinc

Commit Graph

Author	SHA1	Message	Date
David Anderson	59328aaccb	- client: change how short term debt is updated. Old: it's based entirely on CPU time. So a GPU project, whose app uses only a fraction of a CPU, accrues positive debt. This is OK if the project has only GPU apps, since STD is not (currently) used for GPU scheduling. But some projects have both CPU and GPU apps. New: STD is based on total processing. It has terms for each resource type. The notion of "runnable resource share" is specific to a type. Note: the notion of "resource share fraction" appears in a couple of other places: - it's passed to apps in app_init_data.xml - it's passed in scheduler requests. It should be broken down by resource type in these cases too. Note to self: do this later. svn path=/trunk/boinc/; revision=19762	2009-12-02 03:41:52 +00:00
David Anderson	6fc27ffc44	- client: use [wfd] consistently svn path=/trunk/boinc/; revision=19725	2009-11-27 21:21:39 +00:00
David Anderson	609872edf7	- client: the checkin of 15 Oct related to multi-thread apps didn't work due to a typo. - client: if <ncpus> is present in cc_config.xml, we're supposed to act as if there were that many physical CPUs. In particular, we need to set host_info.p_ncpus to that value, since that's what is reported in scheduler requests. svn path=/trunk/boinc/; revision=19522	2009-11-09 19:51:31 +00:00
David Anderson	b29f920999	- Move URL-related code to a new file - Remove stuff related to SOCKS version, since we only support 5 svn path=/trunk/boinc/; revision=19480	2009-11-05 18:02:51 +00:00
David Anderson	9f93535428	- client: bug fixes to the above. Don't fetch work for an unable resource. svn path=/trunk/boinc/; revision=19302	2009-10-14 19:11:11 +00:00
David Anderson	94cbb0d2dc	- client: make the order of the result vector consistent with the order used to select coproc jobs svn path=/trunk/boinc/; revision=19238	2009-10-03 03:07:03 +00:00
David Anderson	833f417ae5	- client: better behavior if a GPU goes away: 1) if an APP_VERSION is missing a coprocessor, don't delete it and its files. (If the coprocessor returns, we won't need to re-download) 2) if a RESULT uses an app version that is missing a coprocessor, abort it (rather than deleting it). The client will report the result on the next scheduler RPC, and the server will make a new instance. svn path=/trunk/boinc/; revision=19235	2009-10-02 23:39:38 +00:00
David Anderson	41e3b06b23	- client and scheduler RPC: add optional <cpu_backoff>, <cuda_backoff>, and <ati_backoff> elements to scheduler reply. These specify backoffs for the resource types, overriding the existing backoff mechanism. Projects can supply these if they don't have apps of a particular type and don't want to get periodic requests for them. svn path=/trunk/boinc/; revision=19059	2009-09-16 17:34:19 +00:00
David Anderson	0d4632d65d	- client: save space in req msg. Didn't make much difference. svn path=/trunk/boinc/; revision=19037	2009-09-10 18:32:46 +00:00
David Anderson	4c52989f59	- client: improve the estimation of "busy time" (see 17 July checkin). If you have 2 CPUs and a 1-day job in EDF mode, the busy time should be zero, not .5 days. Add a class BUSY_TIME_ESTIMATOR that makes a somewhat better (though still fairly crude) estimate. svn path=/trunk/boinc/; revision=19003	2009-09-03 20:31:04 +00:00
David Anderson	112cec62a5	- client: fix to [18945]; we only want to max the overall request with a GPU request if project is anonymous platform AND it has an app for that GPU type - client: report overall work request as well as per-resource-type requests svn path=/trunk/boinc/; revision=18994	2009-09-02 21:36:25 +00:00
David Anderson	29c1751898	- client: if project is anonymous platform, set the overall work req to the max of the requests for different resource types. Otherwise projects with old schedulers won't send us work. svn path=/trunk/boinc/; revision=18945	2009-08-31 03:42:01 +00:00
David Anderson	073e6ded2c	- client and scheduler: lay the groundwork for "fractional coproc jobs", e.g. the Milkyway@home ATI app, of which we can typically run 2 or 3 instances at once on a GPU. Changes include: - In APP_VERSION, don't use a COPROCS to represent the GPU requirements; just use doubles ncudas and natis. - sufficient_coprocs() etc. are no longer members of COPROCS - in HOST_USAGE, ncudas and natis are doubles - in scheduler request, req_instances is now a double This checkin doesn't include the job scheduling logic, i.e. assigning jobs to GPUs. That will follow. svn path=/trunk/boinc/; revision=18868	2009-08-19 18:41:47 +00:00
David Anderson	c3fe504e1d	- client: add ATI support to job scheduling and work fetch svn path=/trunk/boinc/; revision=18850	2009-08-17 16:50:40 +00:00
David Anderson	5753153909	- client: 2nd try on my last checkin. We need to estimate 2 different delays for each resource type: 1) "saturated time": the time the resource will be fully utilized (new name for the old "estimated delay"). This is used to compute work requests. 2) "busy time": the time a new job would have to wait to start using this resource. This is passed to the scheduler and used for a crude deadline check. Note: this is ill-defined; a single number doesn't suffice. But as a very rough estimate, I'll use the sum of (J.duration * J.ninstances)/ninstances over all jobs that miss their deadline under RR sim. svn path=/trunk/boinc/; revision=18629	2009-07-17 18:29:10 +00:00
David Anderson	46d9e8f087	- client: record the time results are received. Process non-EDF GPU jobs in this order. svn path=/trunk/boinc/; revision=18531	2009-06-30 20:22:54 +00:00
David Anderson	10f9e11ee6	- lib: created a new file for declaring "replacements" for functions like strlcpy() etc. config.h is included here rather than in str_util.h svn path=/trunk/boinc/; revision=18437	2009-06-16 20:54:44 +00:00
David Anderson	ab7cf41267	- client: tweak messages svn path=/trunk/boinc/; revision=18280	2009-06-03 21:59:47 +00:00
David Anderson	c2097091fe	- client: show "est. delay" correctly in work fetch debug msgs - client: show times correctly in rr_sim debug msgs - client: in "requesting new tasks" msg, say what resources we're requesting (if there's more than CPU) - client: estimated delay was possibly being calculated incorrectly because of roundoff error svn path=/trunk/boinc/; revision=18269	2009-06-02 22:53:57 +00:00
David Anderson	cc62cce8f7	- client: if scheduler request didn't request work, don't report 0 tasks - scheduler: fix crash if anonymous platform svn path=/trunk/boinc/; revision=18259	2009-06-02 05:12:06 +00:00
David Anderson	44c02144e9	- lib: return proper error codes from boinc_rename() and boinc_mkdir() - client: Haiku support (from Urias McCullough) - client: include plan class in other_result list in sched request (for resource-specific jobs-in-progress limit) svn path=/trunk/boinc/; revision=18250	2009-05-31 16:38:37 +00:00
David Anderson	af93af28f7	- client: eliminate the need to write the state file on each checkpoint. Instead, write the info into a file in the slot directory, and check for these files on startup. This should reduce the overhead of state-file writing on machines with lots of cores. There will still be a flurry of writes each time a job finishes, but reducing that overhead would be a larger job. - client: make sure we write the state file after a failed RPC svn path=/trunk/boinc/; revision=17814	2009-04-15 06:22:53 +00:00
David Anderson	b3f07e1a0c	- client: show project name in "backoff ended" msg svn path=/trunk/boinc/; revision=17719	2009-04-01 23:22:17 +00:00
David Anderson	3e04801942	- client: clear resource backoffs on user-requested RPC - client: randomize resource backoffs to avoid lockstep svn path=/trunk/boinc/; revision=17664	2009-03-26 16:56:20 +00:00
David Anderson	5a5b386313	- client: garbage collect after scheduler RPC; if project sent some irrelevant FILE_INFOs, this will avoid starting transfers for them. svn path=/trunk/boinc/; revision=17644	2009-03-23 01:33:17 +00:00
David Anderson	dfc62d896d	- Manager: show elapsed time instead of CPU time in Task tab. CPU time is visible in task Properties. - Manager: in task Properties, show final CPU and elapsed times if job is finished - client: honor backoff for account-manager-requested scheduler RPCs - client: keep track final elapsed time for results - GUI RPC: report final elapsed time svn path=/trunk/boinc/; revision=17588	2009-03-11 22:01:38 +00:00
David Anderson	47c889f002	- client: backoff overrides project-requested scheduler RPC. Otherwise, if scheduler is down, we'll retry infinitely every 10 secs - client: remove auto_update.poll() (not used) svn path=/trunk/boinc/; revision=17585	2009-03-10 22:14:16 +00:00
David Anderson	e74f93c10d	- client: if using anonymous platform, ignore (and complain about) app versions in scheduler reply - client: when reporting anonymous platform apps in sched request, don't include <file_info>s (not relevant to server) svn path=/trunk/boinc/; revision=17507	2009-03-05 17:45:36 +00:00
David Anderson	c750daed46	- client: reorganize and improve the logic for deciding when to do a scheduler RPC: if user request or acct mgr request, ignore backoff and suspend via GUI; in all other cases honor both of these. svn path=/trunk/boinc/; revision=17503	2009-03-04 22:55:57 +00:00
David Anderson	c481086bc0	- client: show duration estimates for CPU and CUDA separately - web: reverse Reply and Delete buttons in private msg page fixes #858 svn path=/trunk/boinc/; revision=17500	2009-03-04 21:02:18 +00:00
David Anderson	fd5fc4a24b	- client: fix bug that could cause scheduler RPC to ask for work inappropriately, and tell user that it wasn't asking for work. Here's what was going on: There are two different structures with work request fields (req_secs, req_instances, estimated_delay): COPROC_CUDA coproc_cuda and RSC_WORK_FETCH cuda_work_fetch. WORK_FETCH::choose_project() copied from cuda_work_fetch to coproc_cuda, but only if a project was selected. WORK_FETCH::clear_request() clears cuda_work_fetch but not coproc_cuda. Scenario: - a scheduler op is made to project A requesting X>0 secs of CUDA - later, a scheduler op is made to project B for reason other than work fetch (e.g., user request) - choose_project() doesn't choose anything, so the value of coproc_cuda->req_secs remains X - clear_request() is called but that doesn't change coproc_cuda Solution: work-fetch code no longer knows about internals of COPROC_CUDA and is not responsible for settings its request fields. The copying of request fields from RSC_WORK_FETCH to COPROC is done at a higher level, in CLIENT_STATE::make_scheduler_request() Additional bug fix: estimated_delay wasn't being cleared in some cases. svn path=/trunk/boinc/; revision=17411	2009-02-27 18:46:00 +00:00
David Anderson	97b82d4685	- client: shuffle the startup code to avoid showing wrong prefs info on first-time startup. - client: don't do an RPC until we've done CPU benchmarks. We need the benchmark values to fill in app_version.flops svn path=/trunk/boinc/; revision=17404	2009-02-26 22:41:48 +00:00
David Anderson	41fe3e40bf	- client: tag messages with project where possible; fixes #852 - client: show fetch share rather than run share in wfd message svn path=/trunk/boinc/; revision=17398	2009-02-26 17:12:55 +00:00
David Anderson	31e7127776	- client: make timeout values into #defines svn path=/trunk/boinc/; revision=17396	2009-02-26 03:24:39 +00:00
Eric J. Korpela	8f3abcc835	- Added checks for net/.h, arpa/.h, netinet/.h and code to figure out which of those files to include - Modified MAC address check to work on some non-Linux unixes. (mac_address.cpp) - Added suggested change to "already attached to project" checking. (ProjectInfoPage.cpp) - changed includes of standard c header files to their c++ equivalents (i.e. replaced <stdio.h> with <cstdio>) for namespace protection. - replaced "using namespace std;" with more explicit "using std::function" in several files. - Fixed bug in checking whether the os is OS/2 and added conditional OS_OS2 to the build environment. (boinc_platform.m4,configure.ac) - Changed build environment to not use -nostandardlibs unless we are using G++ and static linkage is specified. (configure.ac) - Added makefiles and package building files for solaris CSW package manager. - Fixed bug with attempting to find login name using logname. (configure.ac) - Added ifdef HAVE_ protection around some include files commonly found in sys. - Added support for unified binary for x86_64/i686-pc-solaris. (cs_platforms.cpp) - generate_host_cpid() now uses MAC address on non-linux unix. (hostinfo_network.cpp) - Macro BOINC_SET_COMPILE_FLAGS now doesn't check gcc only flags on non-gcc compilers. (boinc_set_compile_flags.m4) - Library compiles no longer depend upon the library extension or require the library to be prefixed with lib. - More fixes for fcgi builds. - Added declaration of "struct ether_addr" and ether_ntoa(). Have not yet implemented ether_ntoa() for machines that don't have it, or where it is buggy. (unix_util.h) - Added FCGI::perror() which calls FCGI_perror(). (boinc_fcgi.{h,cpp}) - Fixed library Makefiles so that all required headers get installed. svn path=/trunk/boinc/; revision=17388	2009-02-26 00:23:23 +00:00
David Anderson	2574afb41c	- client: more instances of showing project with message. Fixes #848 svn path=/trunk/boinc/; revision=17335	2009-02-23 04:54:04 +00:00
David Anderson	8973e39479	- client: don't complain that master URLs differ if it's only in case svn path=/trunk/boinc/; revision=17310	2009-02-19 21:34:48 +00:00
David Anderson	3e98909ab6	- client: adjust debts at least every minute. This fixes a bug that can cause debts to NEVER get updated. - client: added "abort_jobs_on_exit" feature (available by --abort_jobs_on_exit cmdline or <abort_jobs_on_exit> in cc_config.xml). If set, when the client is exited by user request (this includes signals on Unix) it marks all pending jobs as aborted, and does a scheduler RPC to all projects with jobs. When these are completed the client exits. This is useful when BOINC is being used on grids where it is wiped clean after each run. svn path=/trunk/boinc/; revision=17300	2009-02-18 19:47:02 +00:00
David Anderson	218af029f1	- client: show proxy info correctly on startup - client: fix minor bug that produced spurious adjust debt interval too long messages when zero projects svn path=/trunk/boinc/; revision=17197	2009-02-10 21:59:55 +00:00
David Anderson	872ed1a65c	- client: if master file doesn't have URLs, clear RPC request svn path=/trunk/boinc/; revision=17196	2009-02-10 19:33:46 +00:00
David Anderson	864864ab76	- client: all scheduler RPCs except user request are subject to backoff. - client: if a project-requested RPC doesn't return work, don't do resource backoff. - client: if a user-requested scheduler RPC errors out, clear the request svn path=/trunk/boinc/; revision=17191	2009-02-09 22:00:31 +00:00
David Anderson	258dac62b2	- client: it the state file or an RPC reply has an app version using a coprocessor we don't know about, ignore it (and all results using that app_version will be flushed). This deals with the situation where we have some GPU jobs, but the GPU card is removed (previously this resulted in a crash). This requires some code shuffling so that we check for coprocessors before reading state file. svn path=/trunk/boinc/; revision=17161	2009-02-06 00:22:21 +00:00
David Anderson	af86d4326f	- client: when accounting job elapsed time, ignore intervals longer than 10 secs; that could only happen if the client or host was suspended/hibernated. - client: in adjust_debts(), ignore intervals longer than 2work fetch period, not 2CPU sched period. adjust_debts() is called from work fetch. svn path=/trunk/boinc/; revision=17154	2009-02-05 20:16:28 +00:00
David Anderson	5eeb9c0815	- client: fix bug that caused infinite sched RPCs if project down svn path=/trunk/boinc/; revision=17127	2009-02-03 18:08:40 +00:00
David Anderson	b7a2c227ca	- Work fetch / scheduler: There are two mechanisms to prevent the scheduler from sending jobs that won't finish by their deadline. Simple mechanism: The client sends the interval x for which CPUs are projected to be saturated. Given a job with estimated duration y, the scheduler doesn't send it if x + y exceeds the delay bound. If it does send it, x is incremented by y. Complex mechanism: Client sends workload description. Scheduler does EDF simulation, sees if deadlines are missed. The only project using this AFAIK is BOINC alpha test. Neither of these mechanisms takes coprocessors into account, and as a result jobs could be sent that are doomed to miss their deadline. This checkin adds coprocessor awareness to the Simple mechanism. Changes: Client: compute estimated delay (i.e. time until non-saturation) for coprocessors as well as CPU. Send them in scheduler request as part of coproc descriptor. Scheduler: Keep track of estimated delays separately for different resources - client: fixed bug that computed CPU estimated delay incorrectly - client: the work request (req_secs) for a resource is the min of the project's share and the shortfall. svn path=/trunk/boinc/; revision=17086	2009-01-30 21:25:24 +00:00
David Anderson	604a83aa96	- client: if user requests RPC, do it even if project is backed off - manager: show backoff interval correctly svn path=/trunk/boinc/; revision=17070	2009-01-29 20:07:48 +00:00
David Anderson	8952fbe60e	- client: if we're making an RPC to a project because of user request, clear the resource backoff times so that we potentially can ask the project for work. svn path=/trunk/boinc/; revision=17052	2009-01-27 22:25:32 +00:00
David Anderson	132cc6bba3	- client: debugging CUDA-related stuff - client: if reset a project, clear its overall and per-resource backoffs svn path=/trunk/boinc/; revision=16862	2009-01-10 00:48:22 +00:00
David Anderson	2860574fa5	compile fixes and debug message fixes svn path=/trunk/boinc/; revision=16836	2009-01-08 00:20:04 +00:00
David Anderson	8740ffdc94	- client: more work-fetch stuff. No more per-project shortfall. It's getting pretty close. svn path=/trunk/boinc/; revision=16765	2009-01-03 06:01:17 +00:00
David Anderson	8c591e31df	- client: first whack at new work-fetch logic. Very preliminary. svn path=/trunk/boinc/; revision=16754	2008-12-31 23:07:59 +00:00
David Anderson	2dc7056ee0	- client: code shuffling - scheduler: fix typo in msg svn path=/trunk/boinc/; revision=16750	2008-12-30 19:01:25 +00:00
David Anderson	edf0ab1631	- client: app_info.xml's are parsed before p_fpops is known, so avp->fpops is zero. Fix this by filling in zero avp->fpops later on. svn path=/trunk/boinc/; revision=16633	2008-12-06 03:19:52 +00:00
David Anderson	84f1193a9d	- client: use FLOPs, rather than CPU time, as the basis for estimating job completion times. This should improve estimates for GPU apps, and prevent the DCF from getting messed up. svn path=/trunk/boinc/; revision=16598	2008-12-02 03:58:32 +00:00
David Anderson	07bd768e9d	- server: add -sleep_interval args to file_deleter and transitioner (from Nicolas; fixes #783) svn path=/trunk/boinc/; revision=16576	2008-11-26 19:09:27 +00:00
David Anderson	d9aef115bc	- client: fix crash when sched_op_debug is enabled svn path=/trunk/boinc/; revision=16326	2008-10-27 23:21:33 +00:00
David Anderson	e24e551bd8	- client: clarify and fix the semantics of "next RPC time". Here's are the new semantics: a scheduler reply can include <next_rpc_delay> Make another RPC ASAP after this amount of time elapses. This is specified by the <next_rpc_delay> element in config.xml. <request_delay> Don't make another RPC until this amount of time elapses. This is sent automatically (and sometimes with large delays) by various parts of the scheduler. next_rpc_delay now "overrides" request_delay in the sense that request_delay is ignored if it's greater than next_rpc_delay. In addition: the client maintains a min_rpc_time which is set based on request_delay and also by various exponential backoff schemes. new_rpc_delay now overrides this as well, in the same sense. svn path=/trunk/boinc/; revision=16206	2008-10-14 21:16:04 +00:00
David Anderson	7512f94f51	- client: show est CPC time of jobs returned by sched RPC, if sched_op_debug; fixes #256 svn path=/trunk/boinc/; revision=16149	2008-10-07 01:51:30 +00:00
David Anderson	98cfb8d3b0	- rename .C files to .cpp so that Doxygen will work svn path=/trunk/boinc/; revision=16069	2008-09-26 18:20:24 +00:00

1 2 3 4

159 Commits