TY - GEN
T1 - Automated delivery of Web documents through a caching infrastructure
AU - Rodriguez, Pablo
AU - Biersack, Ernst W.
AU - Ross, Keith W.
PY - 2003
Y1 - 2003
N2 - The dramatic growth of the Internet and of the Web traffic calls for scalable solutions to accessing Web documents. To this purpose, various caching schemes have been proposed and caching has been widely deployed. Since most Web documents change very rarely, the issue of consistency, i.e. how to assure access to the most recent version of a Web document, has received not much attention. However, as the number of frequently changing documents and the number of users accessing these documents increases, it becomes mandatory to propose scalable techniques that assure consistency. We look at one class of techniques that achieve consistency by performing automated delivery of Web documents. Among all schemes imaginable, automated delivery guarantees the lowest access latency for the clients. We compare pull- and push-based schemes for automated delivery and evaluate their performance analytically and via trace-driven simulation. We show that for both, pull- and push-based schemes, the use of a caching infrastructure is important to achieve scalability. For most documents in the Web, a pull distribution with a caching infrastructure can efficiently implement an automated delivery. However, when servers update their documents randomly and servers cannot ensure a minimum time-to-live interval during which documents remain unchanged, pull generates many requests to the origin server. For this case, we consider push-based schemes that use a caching infrastructure and we present a simple algorithm to determine which documents should be pushed given a limited available bandwidth.
AB - The dramatic growth of the Internet and of the Web traffic calls for scalable solutions to accessing Web documents. To this purpose, various caching schemes have been proposed and caching has been widely deployed. Since most Web documents change very rarely, the issue of consistency, i.e. how to assure access to the most recent version of a Web document, has received not much attention. However, as the number of frequently changing documents and the number of users accessing these documents increases, it becomes mandatory to propose scalable techniques that assure consistency. We look at one class of techniques that achieve consistency by performing automated delivery of Web documents. Among all schemes imaginable, automated delivery guarantees the lowest access latency for the clients. We compare pull- and push-based schemes for automated delivery and evaluate their performance analytically and via trace-driven simulation. We show that for both, pull- and push-based schemes, the use of a caching infrastructure is important to achieve scalability. For most documents in the Web, a pull distribution with a caching infrastructure can efficiently implement an automated delivery. However, when servers update their documents randomly and servers cannot ensure a minimum time-to-live interval during which documents remain unchanged, pull generates many requests to the origin server. For this case, we consider push-based schemes that use a caching infrastructure and we present a simple algorithm to determine which documents should be pushed given a limited available bandwidth.
UR - http://www.scopus.com/inward/record.url?scp=84889589974&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84889589974&partnerID=8YFLogxK
U2 - 10.1109/EURMIC.2003.1231595
DO - 10.1109/EURMIC.2003.1231595
M3 - Conference contribution
AN - SCOPUS:84889589974
SN - 0769519962
SN - 9780769519968
T3 - Conference Proceedings of the EUROMICRO
SP - 233
EP - 240
BT - Proceedings - 29th EUROMICRO Conference, EUROMICRO 2003
T2 - 29th EUROMICRO Conference, EUROMICRO 2003
Y2 - 1 September 2003 through 6 September 2003
ER -