A Runtime Heuristic to Selectively Replicate Tasks for Application-Specific Reliability Targets

dc.contributor.author Subasi, Omer
dc.contributor.author Yalcin, Gulay
dc.contributor.author Zyulkyarov, Ferad
dc.contributor.author Unsal, Osman
dc.contributor.author Labarta, Jesus
dc.date.accessioned 2025-09-25T10:39:29Z
dc.date.available 2025-09-25T10:39:29Z
dc.date.issued 2016
dc.description Subasi, Omer/0000-0002-5373-7570; en_US
dc.description.abstract In this paper we propose a runtime-based selective task replication technique for task-parallel high performance computing applications. Our selective task replication technique is automatic and does not require modification/recompilation of OS, compiler or application code. Our heuristic, we call App_FIT, selects tasks to replicate such that the specified reliability target for an application is achieved. In our experimental evaluation, we show that App_FIT selective replication heuristic is low-overhead and highly scalable. In addition, results indicate that complete task replication is overkill for achieving reliability targets. We show that with App_FIT, we can tolerate pessimistic exascale error rates with only 53% of the tasks being replicated. en_US
dc.identifier.doi 10.1109/CLUSTER.2016.54
dc.identifier.isbn 9781509036530
dc.identifier.issn 1552-5244
dc.identifier.scopus 2-s2.0-85013177229
dc.identifier.uri https://doi.org/10.1109/CLUSTER.2016.54
dc.identifier.uri https://hdl.handle.net/20.500.12573/3148
dc.language.iso en en_US
dc.publisher IEEE en_US
dc.relation.ispartof IEEE International Conference on Cluster Computing (CLUSTER) -- SEP 13-15, 2016 -- Taipei, TAIWAN en_US
dc.relation.ispartofseries IEEE International Conference on Cluster Computing
dc.rights info:eu-repo/semantics/openAccess en_US
dc.title A Runtime Heuristic to Selectively Replicate Tasks for Application-Specific Reliability Targets en_US
dc.type Conference Object en_US
dspace.entity.type Publication
gdc.author.id Subasi, Omer/0000-0002-5373-7570
gdc.author.scopusid 57144377900
gdc.author.scopusid 23029394200
gdc.author.scopusid 6505657882
gdc.author.scopusid 35612224700
gdc.author.scopusid 56256013400
gdc.author.wosid Unsal, Osman/B-9161-2016
gdc.bip.impulseclass C5
gdc.bip.influenceclass C5
gdc.bip.popularityclass C5
gdc.coar.access open access
gdc.coar.type text::conference output
gdc.collaboration.industrial false
gdc.description.department Abdullah Gül University en_US
gdc.description.departmenttemp [Subasi, Omer; Zyulkyarov, Ferad; Unsal, Osman; Labarta, Jesus] Barcelona Supercomp Ctr, Barcelona, Spain; [Subasi, Omer; Labarta, Jesus] Univ Politecn Cataluna, E-08028 Barcelona, Spain; [Yalcin, Gulay] Abdullah Gul Univ, Kayseri, Turkey en_US
gdc.description.endpage 505 en_US
gdc.description.publicationcategory Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q3
gdc.description.startpage 498 en_US
gdc.description.woscitationindex Conference Proceedings Citation Index - Science
gdc.description.wosquality N/A
gdc.identifier.openalex W2560662349
gdc.identifier.wos WOS:000391414100078
gdc.index.type WoS
gdc.index.type Scopus
gdc.oaire.diamondjournal false
gdc.oaire.downloads 72
gdc.oaire.impulse 4.0
gdc.oaire.influence 2.9609917E-9
gdc.oaire.isgreen true
gdc.oaire.keywords Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors
gdc.oaire.keywords Parallel processing (Electronic computers)
gdc.oaire.keywords Selective replication
gdc.oaire.keywords Processament en paral·lel (Ordinadors)
gdc.oaire.keywords Task parallelism
gdc.oaire.keywords Dataflow programming
gdc.oaire.keywords HPC and exascale computing
gdc.oaire.keywords :Informàtica::Arquitectura de computadors [Àrees temàtiques de la UPC]
gdc.oaire.popularity 1.8634359E-9
gdc.oaire.publicfunded false
gdc.oaire.sciencefields 0202 electrical engineering, electronic engineering, information engineering
gdc.oaire.sciencefields 02 engineering and technology
gdc.oaire.views 43
gdc.openalex.collaboration International
gdc.openalex.fwci 1.79121105
gdc.openalex.normalizedpercentile 0.88
gdc.opencitations.count 6
gdc.plumx.crossrefcites 3
gdc.plumx.mendeley 6
gdc.plumx.scopuscites 7
gdc.scopus.citedcount 7
gdc.virtual.author Yalçın Alkan, Gülay
gdc.wos.citedcount 6
relation.isAuthorOfPublication e0dc9e40-f936-402f-96c6-f4e668a0b9d3
relation.isAuthorOfPublication.latestForDiscovery e0dc9e40-f936-402f-96c6-f4e668a0b9d3
relation.isOrgUnitOfPublication 665d3039-05f8-4a25-9a3c-b9550bffecef
relation.isOrgUnitOfPublication 52f507ab-f278-4a1f-824c-44da2a86bd51
relation.isOrgUnitOfPublication ef13a800-4c99-4124-81e0-3e25b33c0c2b
relation.isOrgUnitOfPublication.latestForDiscovery 665d3039-05f8-4a25-9a3c-b9550bffecef

Files