boinc

Commit Graph

Author	SHA1	Message	Date
David Anderson	20d07be2b8	back end: add keyword-based component to job scheduling score. - add DB field for storing job keywords: workunit.keywords add this to various DB parse/write functions - add --keywords option to create_work for specifying job keywords - add <keyword_sched> option in config.xml for enabling keyword score (it's disabled by default). If set, increment score for "yes" keyword matches, and disallow jobs with "no" matches - in scheduler, add array job_keywords_array for parsed versions of job keywords (vector<int>) also: - use symbols instead of numbers for slow_check() return values - parse unused fields in req message to remove unparsed-XML warnings	2017-07-22 00:48:38 -07:00
David Anderson	9b75ddebec	remote job submission: fix bug when specify per-job input templates I used map<char, char> to cache templates. Using char* as a map key doesn't work as intended. Change it to std::string.	2017-02-21 06:49:44 -08:00
David Anderson	030069c36a	remove job submission: don't require the presence of an input template file If jobs specify their own input templates, shouldn't have to have one	2017-01-28 01:25:12 -08:00
David Anderson	381e0caf14	Remote job submission: add support for per-job templates in submit requests This supports the TACC use case, in the jobs in a batch can use different Docker images and different input and output file signatures, none of which are known in advance. Python API binding: - A JOB_DESC object can optionally contain wu_template and result_template elements, which are the templates (the actual XML) to use for that job. Add these to the XML request message if present. - Added the same capability to the PHP binding, but not C++. - Added and debugged test cases for both languages. Also, submit_batch() can take either a batch name (in which case the batch is created) or a batch ID (in which the batch was created prior to remotely staging files). RPC handler: - in submit_batch(), check for jobs with templates specified and store them in files. For input templates (which are deleted after creating jobs) we put them in /tmp, and use a map so that if two templates are the same we use 1 file. For output templates (which have to last until all jobs are done) we put them in templates/tmp, with content-based filenames to economize. - When creating jobs, or generating SQL strings for multiple jobs, use these names as --wu_template_filename and --result_template_filename args to create_work (either cmdline args or stdin args) - Delete WU templates when done create_work.cpp: handle per-job --wu_template and --result_template args in stdin job lines (the names of per-job WU and result templates). Maintain a map mapping WU template name to contents, to avoid repeatedly reading them. For jobs that don't specify templates, use the ones specified at the batch level, or the defaults.	2017-01-21 00:24:11 -08:00
Christian Beer	bb6ded7975	use safe_strcpy to prevent buffer overflow fixes CID 27777, 27797 found by Coverity	2015-10-22 15:08:08 +02:00
Christian Beer	145942cbb3	use safe_strcpy to prevent buffer overflow fixes CID 27938 found by Coverity	2015-10-21 16:59:32 +02:00
David Anderson	8cd8c8e7ee	server software: handle 64-bit database IDs The SETI@home result table is about to run out of 32-bit IDs, so we need to move to 64-bit result IDs. This will happen to the workunit table at some point too. I changed the server C++ code to use the "long" type for all DB IDs (and to use appropriate conversion codes like %lu). "long" is 64 bit on 64-bit machines. For uniformity I did this for all tables, even ones (like app) that will never get big. I chose NOT to change the DB schema for now. The new code will work with 32-bit ID fields in the DB. As projects approach the 32-bit limit on a table they can change its ID field, and fields that reference this table, to BIGINT. This is likely to happen only on the result and workunit tables. I put functions in html/ops/db_update.php to change the IDs of these tables.	2015-07-23 10:11:08 -07:00
David Anderson	f81105f707	create_work: show DB error messages	2015-04-13 23:58:59 -07:00
David Anderson	e5d845dad2	create_work: add --continue_on_error option	2015-03-17 09:45:27 -07:00
David Anderson	51b3e05fd1	create_work: add --verbose option	2015-03-16 14:16:27 -07:00
David Anderson	aa04502db4	create_work: allow targeting commands in stdin job descriptions	2014-07-25 00:51:51 -07:00
David Anderson	870cbb0079	create_work program: add --hr_class cmdline argument	2014-07-02 00:17:20 -07:00
David Anderson	558d76212f	server: fix race condition when creating targeted jobs. We were creating the workunit, then updating its transitioner_flags field. If the transitioner runs inbetween, it would (incorrectly) create results for the workunit. Solution: set transitioner_flags during insert.	2014-06-02 19:01:44 -07:00
David Anderson	03850d103d	create_work: error out if bad ID arg in "--target_host ID" etc. Actually we just check that the arg is nonzero. We could look up the DB record (e.g. the host record) but that would slow down mass job creation.	2014-05-21 09:20:33 -07:00
David Anderson	cecee4bc9e	create_work: make targeting work with stdin-based job creation	2014-05-02 00:24:59 -07:00
David Anderson	944e5a3b29	job submission: generate physical name for remote input files Don't require job submitters to come up with (unique) names for remote input files. Just use "jf_MD5".	2014-04-21 13:19:10 -07:00
David Anderson	46d90b2b60	server: improve support for remote input files A "remote input file" is located on a data server other than the project server. Previously these could be specified only in the input template, which was of limited utility. Add new ways of specifying remote input files: 1) in the create_work program, a remote input file can be specified with command-line args --remote_file name URL nbytes MD5 or by the same syntax in stdin when creating multiple jobs 2) add a variant of create_work() called create_work2(), which takes a vector of INFILE_DESC structures that can specify either local or remote files	2014-04-20 23:52:51 -07:00
David Anderson	65ec42da6c	remote job submission: fix bug	2014-04-14 12:33:43 -07:00
David Anderson	39edd6d3f8	remote job submission: improve error reporting	2014-04-12 10:05:59 -07:00
David Anderson	fec574f4e8	create_work: increase the efficiency of bulk job creation The job submission RPC handler (PHP) originally ran the create_work program once per job. This took about 1.5 minutes to create 1000 jobs. Recently I changed this so that create_work only is run once; it does one SQL insert per job. Disappointingly, this was only slightly faster: 1 min per 1000 jobs. This commit changes create_work to create multiple jobs per SQL insert (as many as will fit in a 1 MB query, which is the default limit). This speeds things up by a factor of 100: 1000 jobs in 0.5 sec.	2014-04-10 23:53:19 -07:00
David Anderson	99a21e3443	remote job submission: create batches more efficiently Change the "submit_batch" RPC handler to use the new feature of create_work that lets you create multiple jobs in one command.	2014-04-07 10:14:32 -07:00
David Anderson	c3cbf29af3	create_work: add --stdin option for more efficient batch creation Previously if you wanted to create lots of jobs from a script (e.g. PHP) you had to run create_work once per job. With the --stdin option you run it once, passing it a file (view stdin) with one line per job. Each line can specify a command line and/or a set of input files. On my server this gives a performance of about 1000 jobs per minute, which is less than I would have expected, but all the time is spent in doing MySQL inserts so that's as good as we can do for now. Also fix a bug in stage_file.	2014-04-07 09:07:00 -07:00
David Anderson	ef82d5d9fb	server: fix compile error on systems that don't define MAXPATHLEN	2013-08-22 17:01:45 -07:00
David Anderson	78f7610f6e	remove dependency of boinc_api.h on str_replace.h (and hence config.h) Any files that use strlcpy() or strlcat() must directly include str_replace.h	2013-06-06 17:31:46 -07:00
David Anderson	b9f0733c06	server: replace strcpy() with strlcpy() various places	2013-06-03 22:42:53 -07:00
David Anderson	0c430ce1fa	Add support for multi-size apps See http://boinc.berkeley.edu/trac/wiki/MultiSize The components of this include: - DB changes: add size_class to workunit and result n_size_classes to app; >1 means multi-size - size_regulator daemon program: change results states from INACTIVE to UNSENT carefully - size_census program; writes quantile info in flat files - transitioner: when creating results for multi-size apps, set server state to INACTIVE - sched shmem (feeder): read quantile info from flat files, store in shared memory - scheduler (score-based scheduling): for multi-size apps, add component to score function for size class. - show_shmem: show result size class - make_work (and other callers of count_unsent_results()): count both INACTIVE and UNSENT - create_work: add --size_class cmdline option Also: - if get MySQL errors in upgrade, don't rewrite db_version	2013-04-25 00:27:35 -07:00
David Anderson	8b25f8ccdc	- job submission: show batch priority in web page; add priority stuff to example job submission script	2013-03-04 17:44:39 +01:00
David Anderson	19458ba4de	- Compile fixes for Fedora core 17. From Christian B. Fixes #1194 . - Fix various #include issues. CODING STYLE LAW (minimal inclusion principle): If foo.cpp requires <blah.h>, #include <blah.h> in foo.cpp, NOT foo.h svn path=/trunk/boinc/; revision=25837	2012-07-02 18:51:02 +00:00
David Anderson	32a08d27d9	- C++ code: use MAXPATHLEN for char arrays that hold paths svn path=/trunk/boinc/; revision=25659	2012-05-09 16:11:50 +00:00
David Anderson	f12e82917f	- back end: print error messages instead of numbers in several places svn path=/trunk/boinc/; revision=25584	2012-04-20 17:23:07 +00:00
David Anderson	2ed1cfbbb2	- scheduler and create_work: fix bugs that caused targeted jobs to be sent to non-targeted hosts. The feeder was erroneously putting targeted jobs in the shared mem cache. Changes: - The feeder only enumerates jobs for which workunit.transitioner_flags is zero. NOTE: this field is nonzero iff the job is assigned. - create_work: when creating an assigned jobs, set workunit.transitioner_flags appropriately svn path=/trunk/boinc/; revision=25314	2012-02-22 22:13:08 +00:00
David Anderson	004c5692e3	- create_work tool: change option names for assigned jobs - admin web: show actual platform name in result summary page svn path=/trunk/boinc/; revision=25175	2012-01-31 20:25:26 +00:00
David Anderson	130d6ed4f0	- server: revamp the "assigned job" mechanism. This now supports two main use cases: 1) there's a job that you want to run once on all hosts, present and future (or all hosts belonging to a user, or to a team). The job is never transitioned, validated, or assimilated. 2) There's a normal job for which you want to use only hosts belonging to a specific user (e.g. cluster or cloud hosts). This restriction can be made either when the job is created, or on the fly, e.g. as part of a scheme for accelerating batch completion. For the latter purposes we now provide a function restrict_wu_to_user(DB_WORKUNIT&, int userid); The job goes through the standard transitioner/validator/assimilator path. These cases are enabled by config flags <enable_assignment_multi/> <enable_assignment/> respectively. Assignment of type 2) are no longer stored in shared mem, so there is no limit on their number. There is no longer a rule that assigned job names must contain "asgn". NOTE: this requires a database update. svn path=/trunk/boinc/; revision=25169	2012-01-30 22:39:13 +00:00
David Anderson	bba4ce24ce	- client: compute projects' disk share (based on resource share). Report it (along with disk usage) in scheduler request messages. This will allow the scheduler to send file-delete commands if the project is using more than its share. - client: add <disk_usage_debug> log flag - create_work: add --help, show --command_line option svn path=/trunk/boinc/; revision=24968	2012-01-02 05:53:42 +00:00
David Anderson	5ad821df21	- server: some remote job submission code. Not finished. svn path=/trunk/boinc/; revision=23868	2011-07-22 22:47:41 +00:00
David Anderson	8a9605e48c	- web: add a web-service interface for remotely submitting, querying and controlling batches of jobs - web: add an administrative interface for controlling user permissions for submitting jobs - web: add an interface where users can view and control their submitted jobs See: http://boinc.berkeley.edu/trac/wiki/RemoteJobs This is at a functional but rough stage. svn path=/trunk/boinc/; revision=23762	2011-06-21 22:56:15 +00:00
David Anderson	6b0eba4641	- create_work and other tools: verify that the current dir, parent dir, or BOINC_PROJECT_DIR actually is a project dir. - client simulator: improvements svn path=/trunk/boinc/; revision=23415	2011-04-21 17:04:42 +00:00
David Anderson	8c3ce79b2a	- create_work: add -d option svn path=/trunk/boinc/; revision=22979	2011-02-02 00:42:46 +00:00
David Anderson	7baddb24e3	- Fix bugs in Rappture wrapper - Add README file and templates for Rappture wrapper - create_work: create defaults for --wu_name, --wu_template, --result_template - user web: fix typo on workunit.php svn path=/trunk/boinc/; revision=22826	2010-12-07 19:28:08 +00:00
David Anderson	7ac7bdcb04	- scheduler: message tweak svn path=/trunk/boinc/; revision=22384	2010-09-17 22:01:42 +00:00
David Anderson	7c51512cbf	- transitioner: the format string for a DB query had %.15d instead of %.15e. That produced a messed-up query that assigned garbage values to: host_app_version.turnaround_var host_app_version.turnaround_q host_app_version.max_jobs_per_day host_app_version.consecutive_valid To repair these: - set turnaround_var and turnaround_q to zero - if max_jobs_per_day is outside of (0..config.daily_result_quota) set it to config.daily_result_quota - if consecutive_valid is outside (0..1000), set it to zero I added a script, html/ops/repair_21812.php, that does this; if you ran server code between [21181] and [21812], run this script. - scheduler/transitioner: add <debug_quota> log flag - changed the build system to always use -Wall (if we'd done this before, this bug wouldn't have happened) - fixed a bunch of other compile warnings svn path=/trunk/boinc/; revision=21812	2010-06-25 18:54:37 +00:00
David Anderson	381a15c724	- create_work function and script: check for valid ordering among max_success_results, max_total_results, max_error_results, and target_nresults svn path=/trunk/boinc/; revision=19054	2009-09-16 03:10:22 +00:00
David Anderson	12eb6057e5	- client, Mac: don't do res_init(). It causes a crash. - client (Unix): if client crashes while benchmark processes are going, make sure they detect this and exit. - back-end programs: remove hardwired assumptions about what directory they run in, and hence where config.xml is. E.g., daemons look for it in "..", others expect it in current dir. New approach: all the programs look for the project dir as follows: 1) the environment var BOINC_PROJECT_DIR, if defined 2) the current dir, if config.xml is there. 3) else ".." This means you can run programs in either proj/bin/ or proj/, or (using BOINC_PROJECT_DIR) you can keep executables outside of the project dir. svn path=/trunk/boinc/; revision=18042	2009-05-07 13:54:51 +00:00
David Anderson	f90871a141	- boinc_submit (single-job submission): set the job params to reasonable values (see below), and make it easy to change these values in the script - create_work (function and script): change default job params: FLOPs est: 1e9 => 3600e9 FLOPs bound: 1e10 => 86400e9 mem bound 100MB => 500MB, disk bound 100MB => 1GB delay bound: 100000s => 1 week svn path=/trunk/boinc/; revision=17703	2009-03-30 18:38:11 +00:00
David Anderson	98cfb8d3b0	- rename .C files to .cpp so that Doxygen will work svn path=/trunk/boinc/; revision=16069	2008-09-26 18:20:24 +00:00

45 Commits