boinc

Page: Job replication

Pages

API Implementation ATI Radeon Account managers AccountControl AccountManagement AccountManagers Adaptive Replication AdminAlphaTest AdminApprovedProjects AdminDepLibs AdminDepLibsCurl AdminDepLibsOpenSSL AdminDepLibsSqlite AdminDepLibsZlib AdminInstallerMac AdminInstallerUnix AdminInstallerWin AdminLocalize AdminReleaseAndroid AdminReleaseManagement AdminRoles AdminTasks AdminWrappers Advanced view AlphaInstructions AndroidBoinc AndroidBoincImpl AndroidBoincTesting AndroidBoincTodo AndroidBuildApp AndroidBuildClient AndroidBuildStatus AndroidGuiDiscuss Anonymous platform AppCoprocessor AppDebug AppDebugAndroid AppDebugWin AppDev AppFiltering AppIntro AppLibraries AppMultiThread AppPlan AppPlanSpec AppVersion AppVersionNew Apple Metal Support AssignedWork Assimilation introduction Assimilators in C Assimilators in scripting languages AutoFlops AutoUpdate BOINC Client BOINC Data directory BOINC Manager BOINC Security BOINC screensaver BOINC Help BOINC apps (introduction) BOINC community BOINC events BOINC overview BOINC projects BOINC software development BOINConPhones BUDA implementation BUDA job submission BUDA overview BUDA setup BackendLogic BackendPrograms BackendState BackendUtilities BadgeDoc BadgesOld BashCommandCompletion BasicApi BasicConcepts BerkeleyTasks BetaTest BlackList BoincBasics BoincContributersCall BoincDocker BoincFiles BoincGovernanceWorkingGroups BoincIntro BoincLite BoincPapers BoincPlatforms BoincPmcPage BoincPr BoincProjectsCall BoincSecurity Boinccmd tool BuildClientProcedure BuildMacApp Building BOINC on Unix Building BOINC software CamelCase CancelJobs CertSig Changes to this Wiki Choosing and joining projects Client configuration Client release notes ClientAppConfig ClientDataModel ClientFiles ClientFsm ClientLogic ClientOpaque ClientSched ClientSchedOctTen ClientSchedOld ClientSchedVersionFour ClientSetupLogicWin ClientSetupLogicWinFileLayout ClientSetupLogicWinSix ClientSetupLogicWinSixCleanup ClientSetupWinSix ClientSim CloudServer CodeSigning CodingStyle CommIntro Command line job submission CompileApp CompileAppLinux CompileAppWin CompileClient CompileWithWxWidgets CompoundApps Computation credit Computing with BOINC CondorBoinc ConferenceList Contact BOINC ContributePage Controlling BOINC remotely CoreClient CpuSched Create a BOINC server (cookbook) CreateProjectCookbook Creating a skin for the BOINC Manager Creating custom installers CreditAlt CreditGeneralized CreditNew CreditNotes CreditOptions CreditProposal CreditStats CrossProjectUserId CudaApps DataBase DataFlow DbDump DbIds DbPurge DebugClientWin DeleteFile Deploy Linux apps using VirtualBox (cookbook) DesignKeywords DesktopGrid DevMethodologies DevProcess DevProjects DevProjects_New DevQualityAssurance Development_Workflow DiagnosticsApi DirHierarchy DiskManagement Docker and WSL Docker app cookbook Docker app implementation Docker apps Docker design alternatives Download executables DownloadInfo DownloadOther DrupalConversion DrupalIntegration EastCoast08 Editing computing preferences with the BOINC Manager EmBoinc EmailChangeNotification EmailLists Error handling (cookbook) Error handling (introduction) ErrorReference Error_Abortingtask_Exceededdisklimit Error_Cantdeletepreviousstatefile Error_Givinguponupload Error_Schedulerrequestfailed ExampleApps FileCompression FileDeleter FileList FileUpload FortranApps Fossils GPU computing GPUApp GSoC_13 GdprCompliance GetFile GetFileList GitMigration Global prefs override.xml Going public GpuSched GpuSync GpuWorkFetch GraphicsApi GraphicsApiOld GraphicsApps GraphicsHtml GridIntegration GuiRpc GuiRpcProtocol GuiUrls HTMLGfx HarzPics Heat and energy considerations Home Homogeneous App Version Homogeneous Redundancy Host identification and merging HostId HostMeasurement How BOINC works HtmlOps Initialization files InstallDrupal Installing BOINC on Debian or Ubuntu Installing BOINC on EC2 Installing BOINC on Fedora Installing BOINC on Gentoo Installing BOINC on Ubuntu Installing BOINC Installing on Android Installing on FreeBSD Installing on Linux Installing on MacOS Installing on Windows IntermediateUpload JavaApps Job processing (cookbook) Job processing (introduction) Job replication JobEst JobIn JobIntro JobKeywords JobPinning JobPrioritization JobReplication JobSched JobSizeMatching JobStage JobStatus JobSubmission JobTemplates KeySetup LammpsRemote LdapSupport Linux file permissions Linux DEB and RPM support Linux installer LocalityNew LocalityScheduling Locating stolen computers LogExtension LogRotate LowLatency MacBacktrace MacBuild MacDeveloper MacDeveloperProjects Maintain your BOINC project MakeProject ManagerImpl ManagerMenus MasterUrl MediaWiki MemoryManagement Missing Linux shared libraries MpiApps MultiHost MultiSize MultiUser MultiUserPriority MysqlConfig Network related problems NetworkApps NonCpuIntensive Notifications OpenCL Applications OpenCL CPU applications OpenId OpenclCpu OptionsApi OrgGrid OtherProjectDocs PMC_Minutes PMC_Minutes_2017_12_15 PMC_Minutes_2018_01_10 PageTemplates PasswordHash PayPalDonations PerAppCredit PersFileXfer PhpDb PhysicalFileManagement PlanClassFunc PortalFeatures PowerManagement Preferences PreferencesXml Prefs2 PrefsImpl PrefsOverride PrefsPresets PrefsReference PrefsReference_Time PrefsRemodel PrefsUnification PrepareLinuxBuildMachine Process_proposals ProfileScreen ProjectConfigFile ProjectDaemons ProjectDefaults ProjectGovernance ProjectLaunch ProjectMain ProjectNews ProjectNotices ProjectOptions ProjectPapers ProjectPlan ProjectSecurity ProjectSelect ProjectSkin ProjectSpecificPrefs ProjectSponsors ProjectTasks ProofOfOwnership Proposal_ProjectSimpleAccountCreation ProtectionFromSpam Proxy servers ProxyServer PyMw PythonAppDev PythonApps PythonFramework PythonMw PythonMysql QuickStart RecentChanges Reduce_usage_of_authenticator Reduce_usage_of_authenticator_implementation ReleaseNotes RemoteInputFiles RemoteJob RemoteJobs RemoteLogs RemoteOutputFiles RemoteOverview Reporting client bugs ResearchProjects RightToErasure RpcAuth RpcPolicy RpcProtocol RpmSpec Running Linux apps on BOINC RuntimeEstimation SandBox SandboxUser SchedMatch Scientist interface ScreensaverEnhancements ScreensaverLogic SecureHttp SecurityIssues SendFile Server release notes Server trouble‐shooting ServerComponents ServerDirs ServerIntro ServerSecurity ServerStatus ServerTestInstructions ServerUpdates Simple view Simple attach usage SimpleAttach SingleJob SingleJobImpl SkinExamples SoftwareAddon SoftwareDevelopment SoftwarePrereqsUnix SoftwareTesting SolarisClient SolrIntegration Source code map SourceCode SourceCodeGit SourceCodeGit_Commands SourceCodeGit_Windows SourceCodeGit_WindowsKeygen SourceCodeGit_WorkFlow SourceCodeSvn Sporadic Applications Standard assimilators Standard validators StartTool Starting BOINC on boot (Unix) StatsXml StatusApi StolenComputers Stop or start BOINC daemon after boot StripChart StyleSheets SuperHost TeamDiscussion TeamImport Teams TemplateImages TermsOfUse The BOINC out of box experience The BOINC test drive ToolUpgrade Tools for MacOS TranslateIntro TranslateProject Translate_Coordination TranslationSystem TreeThreader TrickleApi TrickleImpl TrickleMessages TroubleshootClient TroubleshootClient_New Troubleshooting Tutorial_BOINCApplicationDevelopmentLifecycle Tutorial_DeployingVMApplications UnixClientPackage UnixProjectPackage UpdateVersions UploadStatistics Usage rules User file sandbox User manual UserJobs UserOptInConsent Using BOINC with modem, ISDN and VPN connections UsingSvn ValidationLowLevel Validators in C Validators in scripting languages Validators VboxApps Vboxwrapper release notes VersionDiff VersionHistory VersionPathSorter VirtualBox Plan VirtualBox VirtualCampusSupercomputerCenter VirtualMachines Virtualbox Shared Directories VmApps VmCompatibility VmServer Volunteer VolunteerComputing VolunteerDataArchival VolunteerRecruit VolunteerStorage WSL BOINC Image WSL apps WatchDog Weak account key WebCache WebConfig WebForum WebResources WebRpc WebSubmit WebTemplateProposal WhyUseBoinc WikiTodo WinMulticore WindowsIssues WordPressInt WorkDistribution WorkFetchMaxConcurrent WorkGeneration WorkShop07 WorkShop07_BoincGrid WorkShop07_BoincSched WorkShop07_InterpretedApps WorkShop07_PubBoincOne WorkShop07_PubBoincTwo WorkShop07_SecurityGroup WorkShop07_SimplifyApp WorkShop07_Summary WorkShop07_VirtualMachines WorkShop07_WebCode WorkShop08 WorkShop08_WorkshopProceedings WorkShop09 WorkShop09_BatchSched WorkShop09_InterprocComm WorkShop09_ScientistUsability WorkShop09_UserIssues WorkShop09_VmApps WorkShop10 WorkShop10_VmApps WorkShop10_VolunteerIssues WorkShop11 WorkShop11_HackFest WorkShop11_HackFest_Android WorkShop11_MultiUser WorkShop12 WorkShop12_WorkshopSummary WorkShop13 WorkShop13_HackfestNotes WorkShop14 WorkShop18 WorkShop19 Worker release notes WorldWideLexicon Wrapper release notes WrapperApp XaddTool XmlFormat XmlNotes XmlStats test_RunningBoinc

Table of Contents

No replication
Replication

Eliminate discrepancies
Fuzzy comparison
Homogeneous replication

Adaptive replication

The results of a job cannot be trusted, because:

Some hosts have consistent or sporadic hardware problems, typically causing errors in floating-point computation.
Some volunteers may maliciously return wrong results; they may even reverse-engineer your application, deciphering and defeating any internal validation mechanism it might contain.

BOINC offers several mechanisms for validating results. However, there is no "one size fits all" solution. The choice depends on your requirements, and on the nature of your applications (you can use different mechanisms for different applications).

No replication

The first option is to not use replication. Each job gets done once. The validator examines single results.

This approach is useful if you have some way (application-specific) of detecting wrong results with high probability.

Replication

BOINC supports replication: each job gets done on N different hosts, and a result is considered valid if a strict majority of hosts return it.

One problem with replication is that there are discrepancies in the way different computers do floating point math. This makes it hard to determine when two results "agree"; two different results may be equally correct.

There are several different ways of dealing with this problem.

Eliminate discrepancies

It may be possible to eliminate numerical discrepancies. To do so you'll need to select appropriate compiler, compiler options, and math libraries, and to make sure that your checkpoint files are full precision.

This lets you do bitwise comparison of results. However, it is difficult and generally reduces the performance of your application.

Fuzzy comparison

If your application is numerically stable (i.e., small discrepancies lead to small differences in the result) you can write a "fuzzy comparison function" for the validator that considers two results as equivalent if they agree within some tolerance.

Homogeneous replication

With this variant of replication, once an instance of a job has been sent to a host, additional instances are sent only to hosts that are "numerically equivalent" (i.e. that will return bit-identical results).

The notion of "numerical equivalence" depends on your application and how it was compiled. BOINC supplies two pre-defined equivalence relations, "coarse" and "fine". Use either of these ("coarse" is preferable, if it's fine enough for your app) or define your own if needed.

Adaptive replication

This is a refinement of the replication policy. It randomly decides whether to replicate jobs, based on the measured error rate of hosts. If the first instance of a job is sent to a host with a low error rate, then with high probability no further instances will be sent.

Adaptive replication is independent of the comparison policy; you can use it with either bitwise comparison, fuzzy comparison, or homogeneous replication.

Home