Statistical properties of a class of randomized binary search algorithms

In this paper, we analyze the statistical properties of a randomized binary search algorithm and its variants. These algorithms have applications in caching and load balancing in distributed environments such as peer-to-peer networks, cloud storage, data centers, and content distribution networks. T...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Performance evaluation Jg. 168; S. 102478
1. Verfasser: Xia, Ye
Format: Journal Article
Sprache:Englisch
Veröffentlicht: Elsevier B.V 01.06.2025
Schlagworte:
ISSN:0166-5316
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract In this paper, we analyze the statistical properties of a randomized binary search algorithm and its variants. These algorithms have applications in caching and load balancing in distributed environments such as peer-to-peer networks, cloud storage, data centers, and content distribution networks. The basic discrete version of the problem is as follows. Suppose there are m servers, numbered 1, 2, …, m, out of which the first k servers are marked as special, where k is unknown. These k servers may contain a particular file or service that clients want. The objective is to select one of the marked servers uniformly at random. Considering the intended applications, we impose the constraint that there is no central controller to facilitate the selection process. We start with a basic algorithm: In each step, the client requesting the service chooses a number y uniformly at random from 1,2,…,x, where x is the number chosen in the previous step, initially set to m in the first step. A query is then sent to server y asking whether y is marked. If the answer is yes, the algorithm returns y; otherwise, the process is repeated with x←y. In this paper, we primarily consider two batch versions of this algorithm in which multiple numbers are chosen in each step and multiple queries are made in parallel. We derive the mean and variance (exact and/or asymptotic) for the number of search steps in each version of the algorithm, and when possible, we give its distribution. Additionally, we analyze the access pattern of queries across the entire search space.
AbstractList In this paper, we analyze the statistical properties of a randomized binary search algorithm and its variants. These algorithms have applications in caching and load balancing in distributed environments such as peer-to-peer networks, cloud storage, data centers, and content distribution networks. The basic discrete version of the problem is as follows. Suppose there are m servers, numbered 1, 2, …, m, out of which the first k servers are marked as special, where k is unknown. These k servers may contain a particular file or service that clients want. The objective is to select one of the marked servers uniformly at random. Considering the intended applications, we impose the constraint that there is no central controller to facilitate the selection process. We start with a basic algorithm: In each step, the client requesting the service chooses a number y uniformly at random from 1,2,…,x, where x is the number chosen in the previous step, initially set to m in the first step. A query is then sent to server y asking whether y is marked. If the answer is yes, the algorithm returns y; otherwise, the process is repeated with x←y. In this paper, we primarily consider two batch versions of this algorithm in which multiple numbers are chosen in each step and multiple queries are made in parallel. We derive the mean and variance (exact and/or asymptotic) for the number of search steps in each version of the algorithm, and when possible, we give its distribution. Additionally, we analyze the access pattern of queries across the entire search space.
ArticleNumber 102478
Author Xia, Ye
Author_xml – sequence: 1
  givenname: Ye
  orcidid: 0000-0002-9461-3865
  surname: Xia
  fullname: Xia, Ye
  email: yx1@ufl.edu
  organization: Department of Computer and Information Science and Engineering, University of Florida, United States of America
BookMark eNp9j01LAzEQhnOoYFv9A57yB7bmYz_BixS1hYIH9Rxmk4nNst0sSSjor3frevY0w7y8w_OsyGLwAxJyx9mGM17ed5sRz7ARTBTTQeRVvSDLKSizQvLymqxi7BhjRSXZkuzeEiQXk9PQ0zH4EUNyGKm3FKjuIf6uAQbjT-4bDW3dAOGLRoSgjxT6Tx9cOp7iDbmy0Ee8_Ztr8vH89L7dZYfXl_328ZBpXvOUlWCKwla5tpJpxppSVBzqCaepMOcCAG0rm5YB17LVojaWi9pi28i8KiU3ck3E_FcHH2NAq8bgThOS4kxd_FWnLv7q4q9m_6n0MJdwIjs7DCpqh4NG4wLqpIx3_9V_AFjFaHI
Cites_doi 10.1109/BigData50022.2020.9378465
10.1016/j.peva.2014.07.003
10.1145/1842733.1842736
10.1016/j.peva.2020.102151
10.1145/2254756.2254766
10.1145/383059.383071
10.1145/2390021.2390028
10.1016/j.peva.2017.08.004
10.1016/j.peva.2020.102146
10.1016/j.peva.2024.102408
10.1145/258533.258660
ContentType Journal Article
Copyright 2025 Elsevier B.V.
Copyright_xml – notice: 2025 Elsevier B.V.
DBID AAYXX
CITATION
DOI 10.1016/j.peva.2025.102478
DatabaseName CrossRef
DatabaseTitle CrossRef
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
ExternalDocumentID 10_1016_j_peva_2025_102478
S0166531625000124
GroupedDBID --K
--M
-~X
.DC
.~1
0R~
123
1B1
1~.
1~5
29O
4.4
457
4G.
5VS
6OB
7-5
71M
8P~
9JN
9JO
AAAKF
AAAKG
AAEDT
AAEDW
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AAQXK
AARIN
AATTM
AAXKI
AAXUO
AAYFN
AAYWO
ABAOU
ABBOA
ABDPE
ABFNM
ABJNI
ABMAC
ABUCO
ABWVN
ABXDB
ACDAQ
ACGFS
ACNNM
ACRLP
ACRPL
ACVFH
ACZNC
ADCNI
ADEZE
ADGUI
ADJOM
ADMUD
ADNMO
ADVLN
AEBSH
AEIPS
AEKER
AENEX
AEUPX
AFFNX
AFJKZ
AFPUW
AFTJW
AFXIZ
AGCQF
AGHFR
AGQPQ
AGRNS
AGUBO
AGYEJ
AHHHB
AHZHX
AIALX
AIEXJ
AIGII
AIGVJ
AIIUN
AIKHN
AITUG
AKBMS
AKRWK
AKYEP
ALMA_UNASSIGNED_HOLDINGS
AMRAJ
ANKPU
AOUOD
APLSM
APXCP
ARUGR
AXJTR
BKOJK
BLXMC
BNPGV
CS3
DU5
EBS
EFJIC
EJD
EO8
EO9
EP2
EP3
FDB
FEDTE
FGOYB
FIRID
FNPLU
FYGXN
G-Q
GBLVA
GBOLZ
HAMUX
HLZ
HVGLF
HX~
HZ~
IHE
J1W
KOM
LG9
M41
MHUIS
MO0
MS~
N9A
O-L
O9-
OAUVE
OZT
P-8
P-9
P2P
PC.
Q38
R2-
RIG
ROL
RPZ
SBC
SDF
SDG
SDP
SES
SEW
SPC
SPCBC
SSB
SSD
SSH
SSV
SSW
SSZ
T5K
TN5
WUQ
ZY4
~G-
9DU
AAYXX
ACLOT
CITATION
EFKBS
EFLBG
~HD
ID FETCH-LOGICAL-c181t-6ad55f74cf30c0096271a800097e412aaefb39b0a1c3bc28df128feb9347631d3
ISICitedReferencesCount 0
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001442551700001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 0166-5316
IngestDate Sat Nov 29 07:55:27 EST 2025
Sat Jun 21 16:54:47 EDT 2025
IsPeerReviewed true
IsScholarly true
Keywords Distributed search
Randomized binary search algorithm
Analysis of algorithms
Parallel and distributed algorithms
Load balancing
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c181t-6ad55f74cf30c0096271a800097e412aaefb39b0a1c3bc28df128feb9347631d3
ORCID 0000-0002-9461-3865
ParticipantIDs crossref_primary_10_1016_j_peva_2025_102478
elsevier_sciencedirect_doi_10_1016_j_peva_2025_102478
PublicationCentury 2000
PublicationDate June 2025
2025-06-00
PublicationDateYYYYMMDD 2025-06-01
PublicationDate_xml – month: 06
  year: 2025
  text: June 2025
PublicationDecade 2020
PublicationTitle Performance evaluation
PublicationYear 2025
Publisher Elsevier B.V
Publisher_xml – name: Elsevier B.V
References Ross (b17) 1996
Markus Klems, Adam Silberstein, Jianjun Chen, Masood Mortazavi, Sahaya Andrews Albert, P.P.S. Narayan, Adwait Tumbde, Brian Cooper, The Yahoo! cloud datastore load balancer, in: Proceedings of the Fourth International Workshop on Cloud Data Management, CloudDB ’12, 2012, pp. 33–40.
Ion Stoica, Robert Morris, David Karger, M. Fran Kaashoek, Hari Balakrishnan, Chord: A scalable peer-to-peer lookup service for internet applications, in: Proc. ACM SIGCOMM, San Diego, CA, 2001.
Nygren, Sitaraman, Sun (b8) 2010; 44
Tenenbaum, Pollard (b16) 1985
Liu, Bai, Liu, Li, Kim, Braverman, Jin, Stoica (b12) 2019
Eric W. Weisstein, Vieta’s formulas. MathWorld — A Wolfram Web resource.
Brighten Godfrey, Karthik Lakshminarayanan, Sonesh Surana, Richard Karp, Ion Stoica, Load balancing in dynamic structured P2P systems, in: Proceedings of IEEE Infocom, Hong Kong, 2004.
Carlsson, Eager, Gopinathan, Li (b11) 2014; 79
Bhambay, Mukhopadhyay, Vasantam (b4) 2024; 164
.
David Karger, Eric Lehman, Tom Leighton, Matthew Levine, Daniel Lewin, Rina Panigrahy, Consistent hashing and random trees: Distributed caching protocols for relieving hot spots on the World Wide Web, in: Proceedings of the Twenty-Ninth Annual ACM Symposium on Theory of Computing, STOC’97, El Paso, TX, 1997.
Jerri (b18) 1985
Polyanin, Zaitsev (b14) 2003
Izagirre, Makowski (b1) 2017; 116
Zhou, Shroff, Wierman (b2) 2021; 145
Berk Atikoglu, Yuehai Xu, Eitan Frachtenberg, Song Jiang, Mike Paleczny, Workload analysis of a large-scale key–value store, in: SIGMETRICS ’12, 2012, pp. 53–64.
Gardner, Abdul Jaleel, Wickeham, Doroudi (b3) 2021; 145
Gradshteyn, Ryzhik (b19) 2000
Zwillinger (b20) 2003
Yibo Chen, Xing Xiang, Xiao Ling, Xiangchi Zhang, Fan Wu, Jianliang Gao, Dynamic load balance for hot-spot and unbalance region problems in HBase, in: 2020 IEEE International Conference on Big Data, Big Data, 2020.
Bhattacharyya, Chapellat, Keel (b15) 1995
10.1016/j.peva.2025.102478_b10
10.1016/j.peva.2025.102478_b21
10.1016/j.peva.2025.102478_b13
Carlsson (10.1016/j.peva.2025.102478_b11) 2014; 79
Nygren (10.1016/j.peva.2025.102478_b8) 2010; 44
Liu (10.1016/j.peva.2025.102478_b12) 2019
Jerri (10.1016/j.peva.2025.102478_b18) 1985
Polyanin (10.1016/j.peva.2025.102478_b14) 2003
Ross (10.1016/j.peva.2025.102478_b17) 1996
Gardner (10.1016/j.peva.2025.102478_b3) 2021; 145
Bhambay (10.1016/j.peva.2025.102478_b4) 2024; 164
Tenenbaum (10.1016/j.peva.2025.102478_b16) 1985
Izagirre (10.1016/j.peva.2025.102478_b1) 2017; 116
Bhattacharyya (10.1016/j.peva.2025.102478_b15) 1995
10.1016/j.peva.2025.102478_b9
Gradshteyn (10.1016/j.peva.2025.102478_b19) 2000
10.1016/j.peva.2025.102478_b5
Zwillinger (10.1016/j.peva.2025.102478_b20) 2003
Zhou (10.1016/j.peva.2025.102478_b2) 2021; 145
10.1016/j.peva.2025.102478_b7
10.1016/j.peva.2025.102478_b6
References_xml – year: 2003
  ident: b20
  article-title: CRC Standard Mathematical Tables and Formulae
– reference: David Karger, Eric Lehman, Tom Leighton, Matthew Levine, Daniel Lewin, Rina Panigrahy, Consistent hashing and random trees: Distributed caching protocols for relieving hot spots on the World Wide Web, in: Proceedings of the Twenty-Ninth Annual ACM Symposium on Theory of Computing, STOC’97, El Paso, TX, 1997.
– reference: Markus Klems, Adam Silberstein, Jianjun Chen, Masood Mortazavi, Sahaya Andrews Albert, P.P.S. Narayan, Adwait Tumbde, Brian Cooper, The Yahoo! cloud datastore load balancer, in: Proceedings of the Fourth International Workshop on Cloud Data Management, CloudDB ’12, 2012, pp. 33–40.
– reference: Berk Atikoglu, Yuehai Xu, Eitan Frachtenberg, Song Jiang, Mike Paleczny, Workload analysis of a large-scale key–value store, in: SIGMETRICS ’12, 2012, pp. 53–64.
– start-page: 143
  year: 2019
  end-page: 157
  ident: b12
  article-title: Distcache: Provable load balancing for large-scale storage systems with distributed caching
  publication-title: Proceedings of the 17th USENIX Conference on File and Storage Technologies
– volume: 145
  year: 2021
  ident: b3
  article-title: Scalable load balancing in the presence of heterogeneous servers
  publication-title: Perform. Eval.
– volume: 164
  year: 2024
  ident: b4
  article-title: The impact of load comparison errors on the power-of-d load balancing
  publication-title: Perform. Eval.
– year: 1996
  ident: b17
  article-title: Stochastic Processes
– year: 1985
  ident: b18
  article-title: Introduction to Integral Equations with Applications
– volume: 116
  start-page: 26
  year: 2017
  end-page: 52
  ident: b1
  article-title: Light traffic behavior under the power-of-two load balancing strategy: The case of heterogeneous servers
  publication-title: Perform. Eval.
– reference: .
– year: 1995
  ident: b15
  article-title: Robust Control — The Parametric Approach
– reference: Eric W. Weisstein, Vieta’s formulas. MathWorld — A Wolfram Web resource.
– reference: Ion Stoica, Robert Morris, David Karger, M. Fran Kaashoek, Hari Balakrishnan, Chord: A scalable peer-to-peer lookup service for internet applications, in: Proc. ACM SIGCOMM, San Diego, CA, 2001.
– year: 2003
  ident: b14
  article-title: Handbook of Exact Solutions for Ordinary Differential Equations
– volume: 44
  start-page: 2
  year: 2010
  end-page: 19
  ident: b8
  article-title: The akamai network: A platform for high-performance internet applications
  publication-title: ACM SIGOPS Oper. Syst. Rev.
– year: 1985
  ident: b16
  article-title: Ordinary Differential Equations
– reference: Yibo Chen, Xing Xiang, Xiao Ling, Xiangchi Zhang, Fan Wu, Jianliang Gao, Dynamic load balance for hot-spot and unbalance region problems in HBase, in: 2020 IEEE International Conference on Big Data, Big Data, 2020.
– reference: Brighten Godfrey, Karthik Lakshminarayanan, Sonesh Surana, Richard Karp, Ion Stoica, Load balancing in dynamic structured P2P systems, in: Proceedings of IEEE Infocom, Hong Kong, 2004.
– year: 2000
  ident: b19
  article-title: Tables of Integrals, Series, and Products
– volume: 145
  year: 2021
  ident: b2
  article-title: Asymptotically optimal load balancing in large-scale heterogeneous systems with multiple dispatchers
  publication-title: Perform. Eval.
– volume: 79
  start-page: 38
  year: 2014
  end-page: 55
  ident: b11
  article-title: Caching and optimized request routing in cloud-based content delivery systems
  publication-title: Perform. Eval.
– ident: 10.1016/j.peva.2025.102478_b7
  doi: 10.1109/BigData50022.2020.9378465
– year: 1985
  ident: 10.1016/j.peva.2025.102478_b18
– volume: 79
  start-page: 38
  year: 2014
  ident: 10.1016/j.peva.2025.102478_b11
  article-title: Caching and optimized request routing in cloud-based content delivery systems
  publication-title: Perform. Eval.
  doi: 10.1016/j.peva.2014.07.003
– start-page: 143
  year: 2019
  ident: 10.1016/j.peva.2025.102478_b12
  article-title: Distcache: Provable load balancing for large-scale storage systems with distributed caching
– ident: 10.1016/j.peva.2025.102478_b21
– year: 2003
  ident: 10.1016/j.peva.2025.102478_b20
– volume: 44
  start-page: 2
  issue: 3
  year: 2010
  ident: 10.1016/j.peva.2025.102478_b8
  article-title: The akamai network: A platform for high-performance internet applications
  publication-title: ACM SIGOPS Oper. Syst. Rev.
  doi: 10.1145/1842733.1842736
– year: 1996
  ident: 10.1016/j.peva.2025.102478_b17
– year: 2000
  ident: 10.1016/j.peva.2025.102478_b19
– volume: 145
  year: 2021
  ident: 10.1016/j.peva.2025.102478_b3
  article-title: Scalable load balancing in the presence of heterogeneous servers
  publication-title: Perform. Eval.
  doi: 10.1016/j.peva.2020.102151
– ident: 10.1016/j.peva.2025.102478_b6
– ident: 10.1016/j.peva.2025.102478_b10
  doi: 10.1145/2254756.2254766
– ident: 10.1016/j.peva.2025.102478_b5
  doi: 10.1145/383059.383071
– ident: 10.1016/j.peva.2025.102478_b9
  doi: 10.1145/2390021.2390028
– volume: 116
  start-page: 26
  year: 2017
  ident: 10.1016/j.peva.2025.102478_b1
  article-title: Light traffic behavior under the power-of-two load balancing strategy: The case of heterogeneous servers
  publication-title: Perform. Eval.
  doi: 10.1016/j.peva.2017.08.004
– year: 1995
  ident: 10.1016/j.peva.2025.102478_b15
– volume: 145
  year: 2021
  ident: 10.1016/j.peva.2025.102478_b2
  article-title: Asymptotically optimal load balancing in large-scale heterogeneous systems with multiple dispatchers
  publication-title: Perform. Eval.
  doi: 10.1016/j.peva.2020.102146
– year: 2003
  ident: 10.1016/j.peva.2025.102478_b14
– year: 1985
  ident: 10.1016/j.peva.2025.102478_b16
– volume: 164
  year: 2024
  ident: 10.1016/j.peva.2025.102478_b4
  article-title: The impact of load comparison errors on the power-of-d load balancing
  publication-title: Perform. Eval.
  doi: 10.1016/j.peva.2024.102408
– ident: 10.1016/j.peva.2025.102478_b13
  doi: 10.1145/258533.258660
SSID ssj0005730
Score 2.401243
Snippet In this paper, we analyze the statistical properties of a randomized binary search algorithm and its variants. These algorithms have applications in caching...
SourceID crossref
elsevier
SourceType Index Database
Publisher
StartPage 102478
SubjectTerms Analysis of algorithms
Distributed search
Load balancing
Parallel and distributed algorithms
Randomized binary search algorithm
Title Statistical properties of a class of randomized binary search algorithms
URI https://dx.doi.org/10.1016/j.peva.2025.102478
Volume 168
WOSCitedRecordID wos001442551700001&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVESC
  databaseName: Elsevier SD Freedom Collection Journals 2021
  issn: 0166-5316
  databaseCode: AIEXJ
  dateStart: 19950101
  customDbUrl:
  isFulltext: true
  dateEnd: 99991231
  titleUrlDefault: https://www.sciencedirect.com
  omitProxy: false
  ssIdentifier: ssj0005730
  providerName: Elsevier
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3LT8IwGG8UPHjxbcRXevBmRtjWreuRGAwaQzigwdPSrZ1CdCOIhvjX-3XtGOIjcvCyNN32temv_fpr8z0QOhOSU4c7iSUDR1okkNRijMcWhUXuqXDeUZ4l4u6GdjpBv8-6JtvbS55OgKZpMJ2y0b9CDXUAtnKdXQLumVCogDKADk-AHZ5_Al7Rxzz6svKxUlftYxUztfCDBK6sirBDiex58K74p_bINdcf_OkhGw8mjyaIuaGt3Tn3gjI-eAFWXxvc3sv5KwTHK02diltF37dgMfqf1KJOd2MUG_AQonPtfNG5-vg_rI-g_boSXy8__hzgemHjmZkDFpZmw1DJCJWMUMtYRVWHegzUVbV51epfl9Y7NE8jM-u58YfSpnuLPfmec8zxiN4W2jAHANzUwG2jFZnuoM0iuQY2unYXtedwxCWOOEswxzmOqljiiDWOWOOISxz30O1lq3fRtkzaCysGujWxfC48L6EkTtxGrI6YDrV5kHvcSGI7nMskclnU4HbsRrETiAQ4RiIj5hLYLGzh7qNKmqXyAGE7YIK7EbwmLmEJ4cwVJJKUx0Iy4YsaOi8GJhzp6Cbhz2DUkFeMXWj4meZdIUyFX_47XKqVI7ReztFjVJmMX-UJWovfYMzHp2YefAAy9lzQ
linkProvider Elsevier
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Statistical+properties+of+a+class+of+randomized+binary+search+algorithms&rft.jtitle=Performance+evaluation&rft.au=Xia%2C+Ye&rft.date=2025-06-01&rft.issn=0166-5316&rft.volume=168&rft.spage=102478&rft_id=info:doi/10.1016%2Fj.peva.2025.102478&rft.externalDBID=n%2Fa&rft.externalDocID=10_1016_j_peva_2025_102478
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0166-5316&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0166-5316&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0166-5316&client=summon