Context Navigation

lossmodel.lyx

Visit:

Last change on this file was 3782c27, checked in by Shawn Willden <shawn-tahoe@…>, at 2009-07-28T22:44:30Z

Lossmodel updates

Various improvements to the lossmodel, plus addition of README.lossmodel
that provides a link to the PDF.

Property mode set to 100644

File size: 55.8 KB

Line
1	#LyX 1.6.2 created this file. For more info see http://www.lyx.org/
2	\lyxformat 345
3	\begin_document
4	\begin_header
5	\textclass amsart
6	\use_default_options true
7	\begin_modules
8	theorems-ams
9	theorems-ams-extended
10	\end_modules
11	\language english
12	\inputencoding auto
13	\font_roman default
14	\font_sans default
15	\font_typewriter default
16	\font_default_family default
17	\font_sc false
18	\font_osf false
19	\font_sf_scale 100
20	\font_tt_scale 100
21
22	\graphics default
23	\float_placement h
24	\paperfontsize default
25	\spacing single
26	\use_hyperref false
27	\papersize default
28	\use_geometry false
29	\use_amsmath 1
30	\use_esint 1
31	\cite_engine basic
32	\use_bibtopic false
33	\paperorientation portrait
34	\secnumdepth 3
35	\tocdepth 3
36	\paragraph_separation indent
37	\defskip medskip
38	\quotes_language english
39	\papercolumns 1
40	\papersides 1
41	\paperpagestyle default
42	\tracking_changes false
43	\output_changes false
44	\author ""
45	\author ""
46	\end_header
47
48	\begin_body
49
50	\begin_layout Title
51	Tahoe Distributed Filesharing System Loss Model
52	\end_layout
53
54	\begin_layout Author
55	Shawn Willden
56	\end_layout
57
58	\begin_layout Date
59	07/22/2009
60	\end_layout
61
62	\begin_layout Address
63	South Weber, Utah
64	\end_layout
65
66	\begin_layout Email
67	shawn@willden.org
68	\end_layout
69
70	\begin_layout Abstract
71	The abstract goes here
72	\end_layout
73
74	\begin_layout Section
75	Problem Statement
76	\end_layout
77
78	\begin_layout Standard
79	The allmydata Tahoe distributed file system uses Reed-Solomon erasure coding
80	to split files into
81	\begin_inset Formula $N$
82	\end_inset
83
84	shares which are delivered to randomly-selected peers in a distributed
85	network.
86	The file can later be reassembled from any
87	\begin_inset Formula $k\leq N$
88	\end_inset
89
90	of the shares, if they are available.
91	\end_layout
92
93	\begin_layout Standard
94	Over time shares are lost for a variety of reasons.
95	Storage servers may crash, be destroyed or simply be removed from the network.
96	To mitigate such losses, Tahoe network clients employ a repair agent which
97	scans the peers once per time period
98	\begin_inset Formula $A$
99	\end_inset
100
101	and determines how many of the shares remain.
102	If less than
103	\begin_inset Formula $L$
104	\end_inset
105
106	(
107	\begin_inset Formula $k\leq L\leq N$
108	\end_inset
109
110	) shares remain, then the repairer reconstructs the file shares and redistribute
111	s the missing ones, bringing the availability back up to full.
112	\end_layout
113
114	\begin_layout Standard
115	The question we're trying to answer is "What is the probability that we'll
116	be able to reassemble the file at some later time
117	\begin_inset Formula $T$
118	\end_inset
119
120	?".
121	We'd also like to be able to determine what values we should choose for
122
123	\begin_inset Formula $k$
124	\end_inset
125
126	,
127	\begin_inset Formula $N$
128	\end_inset
129
130	,
131	\begin_inset Formula $A$
132	\end_inset
133
134	, and
135	\begin_inset Formula $L$
136	\end_inset
137
138	in order to ensure
139	\begin_inset Formula $Pr[loss]\leq r$
140	\end_inset
141
142	for some threshold probability
143	\begin_inset Formula $r$
144	\end_inset
145
146	.
147	This is an optimization problem because although we could obtain very low
148
149	\begin_inset Formula $Pr[loss]$
150	\end_inset
151
152	by selecting conservative parameters, these choices have costs.
153	The peer storage and bandwidth consumed by the share distribution process
154	are approximately
155	\begin_inset Formula $\nicefrac{N}{k}$
156	\end_inset
157
158	times the size of the original file, so we would like to minimize
159	\begin_inset Formula $\nicefrac{N}{k}$
160	\end_inset
161
162	, consistent with
163	\begin_inset Formula $Pr[loss]\leq r$
164	\end_inset
165
166	.
167	Likewise, a frequent and aggressive repair process keeps the number of
168	shares available close to
169	\begin_inset Formula $N,$
170	\end_inset
171
172	but at a cost in bandwidth and processing time as the repair agent downloads
173
174	\begin_inset Formula $k$
175	\end_inset
176
177	shares, reconstructs the file and uploads new shares to replace those that
178	are lost.
179	\end_layout
180
181	\begin_layout Section
182	Reliability
183	\end_layout
184
185	\begin_layout Standard
186	The probability that the file becomes unrecoverable is dependent upon the
187	probability that the peers to whom we send shares are able to return those
188	copies on demand.
189	Shares that are corrupted are detected and discarded, so there is no need
190	to distinguish between corruption and loss.
191	\end_layout
192
193	\begin_layout Standard
194	Many factors affect share availability.
195	Availability can be temporarily interrupted by peer unavailability due
196	to network outages, power failures or administrative shutdown, among other
197	reasons.
198	Availability can be permanently lost due to failure or corruption of storage
199	media, catastrophic damage to the peer system, administrative error, withdrawal
200	from the network, malicious corruption, etc.
201	\end_layout
202
203	\begin_layout Standard
204	The existence of intermittent failure modes motivates the introduction of
205	a distinction between
206	\noun on
207	availability
208	\noun default
209	and
210	\noun on
211	reliability
212	\noun default
213	.
214	Reliability is the probability that a share is retrievable assuming intermitten
215	t failures can be waited out, so reliability considers only permanent failures.
216	Availability considers all failures, and is focused on the probability
217	of retrieval within some defined time frame.
218	\end_layout
219
220	\begin_layout Standard
221	Another consideration is that some failures affect multiple shares.
222	If multiple shares of a file are stored on a single hard drive, for example,
223	failure of that drive may lose them all.
224	Catastrophic damage to a data center may destroy all shares on all peers
225	in that data center.
226	\end_layout
227
228	\begin_layout Standard
229	While the types of failures that may occur are quite consistent across peers,
230	their probabilities differ dramatically.
231	A professionally-administered server with redundant storage, power and
232	Internet located in a carefully-monitored data center with automatic fire
233	suppression systems is much less likely to become either temporarily or
234	permanently unavailable than the typical virus and malware-ridden home
235	computer on a single cable modem connection.
236	A variety of situations in between exist as well, such as the case of the
237	author's home file server, which is administered by an IT professional
238	and uses RAID level 6 redundant storage, but runs on old, cobbled-together
239	equipment, and has a consumer-grade Internet connection.
240	\end_layout
241
242	\begin_layout Standard
243	To begin with, let's use a simple definition of reliability:
244	\end_layout
245
246	\begin_layout Definition
247
248	\noun on
249	Reliability
250	\noun default
251	is the probability
252	\begin_inset Formula $p_{i}$
253	\end_inset
254
255	that a share
256	\begin_inset Formula $s_{i}$
257	\end_inset
258
259	will survive to (be retrievable at) time
260	\begin_inset Formula $T=A$
261	\end_inset
262
263	, ignoring intermittent failures.
264	That is, the probability that the share will be retrievable at the end
265	of the current repair cycle, and therefore usable by the repairer to regenerate
266	any lost shares.
267	\end_layout
268
269	\begin_layout Standard
270	Reliability
271	\begin_inset Formula $p_{i}$
272	\end_inset
273
274	is clearly dependent on
275	\begin_inset Formula $A$
276	\end_inset
277
278	.
279	Short repair cycles offer less time for shares to
280	\begin_inset Quotes eld
281	\end_inset
282
283	decay
284	\begin_inset Quotes erd
285	\end_inset
286
287	into unavailability.
288	\end_layout
289
290	\begin_layout Subsection
291	Peer Reliability
292	\end_layout
293
294	\begin_layout Standard
295	Since peer reliability is the basis for any computations we may do on share
296	and file reliability, we must have a way to estimate it.
297	Reliability modeling of hardware, software and human performance are each
298	complex topics, the subject of much ongoing research.
299	In particular, the reliability of one of the key components of any peer
300	from our perspective -- the hard drive where file shares are stored --
301	is the subject of much current debate.
302	\end_layout
303
304	\begin_layout Standard
305	A common assumption about hardware failure is that it follows the
306	\begin_inset Quotes eld
307	\end_inset
308
309	bathtub curve
310	\begin_inset Quotes erd
311	\end_inset
312
313	, with frequent failures during the first few months, a constant failure
314	rate for a few years and then a rising failure rate as the hardware wears
315	out.
316	This curve is often flattened by burn-in stress testing, and by periodic
317	replacement that assures that in-service components never reach
318	\begin_inset Quotes eld
319	\end_inset
320
321	old age
322	\begin_inset Quotes erd
323	\end_inset
324
325	.
326	\end_layout
327
328	\begin_layout Standard
329	In any case, we're generally going to ignore all of that complexity and
330	focus on the bottom of the bathtub, assuming constant failure rates.
331	This is a particularly reasonable assumption as long as we're focused on
332	failures during a particular, relatively short interval
333	\begin_inset Formula $A$
334	\end_inset
335
336	.
337	Towards the end of this paper, as we examine failures over many repair
338	intervals, the assumption becomes more tenuous, and we note some of the
339	issues.
340	\end_layout
341
342	\begin_layout Subsubsection
343	Estimate Adaptation
344	\end_layout
345
346	\begin_layout Standard
347	Even assuming constant failure rates, however, it will be rare that the
348	duration of
349	\begin_inset Formula $A$
350	\end_inset
351
352	coincides with the available failure rate data, particularly since we want
353	to view
354	\begin_inset Formula $A$
355	\end_inset
356
357	as a tunable parameter.
358	It's necessary to be able adapt failure rates baselined against any given
359	duration to the selected value of
360	\begin_inset Formula $A$
361	\end_inset
362
363	.
364	\end_layout
365
366	\begin_layout Standard
367	Another issue is that failure rates of hardware, etc., are necessarily continuous
368	in nature, while the per-interval failure/survival rates that are of interest
369	for file reliability calculations are discrete -- a peer either survives
370	or fails during the interval.
371	The continuous nature of failure rates means that the common and obvious
372	methods for estimating failure rates result in values that follow continuous,
373	not discrete distributions.
374	The difference is minor for small failure probabilities, and converges
375	to zero as the number of intervals goes to infinity, but is important enough
376	in some cases to be worth correcting for.
377	\end_layout
378
379	\begin_layout Standard
380	Continuous failure rates are described in terms of mean time to failure,
381	and under the assumption that failure rates are constant, are exponentially
382	distributed.
383	Under these assumptions, the probability that a machine fails at time
384	\begin_inset Formula $t$
385	\end_inset
386
387	, is
388	\begin_inset Formula \[
389	f\left(t\right)=\lambda e^{-\lambda t}\]
390
391	\end_inset
392
393	where
394	\begin_inset Formula $\lambda$
395	\end_inset
396
397	represents the per unit-time failure rate.
398	The probability that a machine fails at or before time
399	\begin_inset Formula $A$
400	\end_inset
401
402	is therefore
403	\begin_inset Formula \begin{align}
404	F\left(t\right) & =\int_{0}^{A}f\left(x\right)dx\nonumber \\
405	& =\int_{0}^{A}\lambda e^{-\lambda x}dx\nonumber \\
406	& =1-e^{-\lambda A}\label{eq:failure-time}\end{align}
407
408	\end_inset
409
410
411	\end_layout
412
413	\begin_layout Standard
414	Note that
415	\begin_inset Formula $A$
416	\end_inset
417
418	and
419	\begin_inset Formula $\lambda$
420	\end_inset
421
422	in
423	\begin_inset CommandInset ref
424	LatexCommand ref
425	reference "eq:failure-time"
426
427	\end_inset
428
429	must be expressed in consistent time units.
430	If they're different, unit conversions should be applied in the normal
431	way.
432	For example, if the estimate for
433	\begin_inset Formula $\lambda$
434	\end_inset
435
436	is 750 failures per million hours, and
437	\begin_inset Formula $A$
438	\end_inset
439
440	is one month, then either
441	\begin_inset Formula $A$
442	\end_inset
443
444	should be represented as
445	\begin_inset Formula $30\cdot24/1000000=.00072$
446	\end_inset
447
448	, or
449	\begin_inset Formula $\lambda$
450	\end_inset
451
452	should be converted to failures per month.
453	Or both may be converted to hours.
454	\end_layout
455
456	\begin_layout Subsubsection
457	Acquiring Peer Reliability Estimates
458	\end_layout
459
460	\begin_layout Standard
461	Need to write this.
462	\end_layout
463
464	\begin_layout Subsection
465	Uniform Reliability
466	\begin_inset CommandInset label
467	LatexCommand label
468	name "sub:Fixed-Reliability"
469
470	\end_inset
471
472
473	\end_layout
474
475	\begin_layout Standard
476	In the simplest case, the peers holding the file shares all have the same
477	reliability
478	\begin_inset Formula $p$
479	\end_inset
480
481	, and are all independent from one another.
482	Let
483	\begin_inset Formula $K$
484	\end_inset
485
486	be a random variable that represents the number of shares that survive
487
488	\begin_inset Formula $A$
489	\end_inset
490
491	.
492	Each share's survival can be viewed as an independent Bernoulli trial with
493	a success probability of
494	\begin_inset Formula $p$
495	\end_inset
496
497	, which means that
498	\begin_inset Formula $K$
499	\end_inset
500
501	follows the binomial distribution with parameters
502	\begin_inset Formula $N$
503	\end_inset
504
505	and
506	\begin_inset Formula $p$
507	\end_inset
508
509	.
510	That is,
511	\begin_inset Formula $K\sim B(N,p)$
512	\end_inset
513
514	.
515	\end_layout
516
517	\begin_layout Theorem
518	Binomial Distribution Theorem
519	\end_layout
520
521	\begin_layout Theorem
522	Consider
523	\begin_inset Formula $n$
524	\end_inset
525
526	independent Bernoulli trials
527	\begin_inset Foot
528	status collapsed
529
530	\begin_layout Plain Layout
531	A Bernoulli trial is simply a test of some sort that results in one of two
532	outcomes, one of which is designated success and the other failure.
533	The classic example of a Bernoulli trial is a coin toss.
534	\end_layout
535
536	\end_inset
537
538	that succeed with probability
539	\begin_inset Formula $p$
540	\end_inset
541
542	, and let
543	\begin_inset Formula $K$
544	\end_inset
545
546	be a random variable that represents the number,
547	\begin_inset Formula $m$
548	\end_inset
549
550	, of successes,
551	\begin_inset Formula $0\le m\le n$
552	\end_inset
553
554	.
555	We say that
556	\begin_inset Formula $K$
557	\end_inset
558
559	follows the Binomial Distribution with parameters n and p, denoted
560	\begin_inset Formula $K\sim B(n,p)$
561	\end_inset
562
563	.
564	The probability mass function (PMF) of K is a function that gives the probabili
565	ty that
566	\begin_inset Formula $K$
567	\end_inset
568
569	takes a particular value
570	\begin_inset Formula $m$
571	\end_inset
572
573	(the probability that there are exactly
574	\begin_inset Formula $m$
575	\end_inset
576
577	successful trials, and therefore
578	\begin_inset Formula $n-m$
579	\end_inset
580
581	failures).
582	The PMF of K is
583	\begin_inset Formula \begin{equation}
584	Pr[K=m]=f(m;n,p)=\binom{n}{m}p^{m}(1-p)^{n-m}\label{eq:binomial-pmf}\end{equation}
585
586	\end_inset
587
588
589	\end_layout
590
591	\begin_layout Proof
592	Consider the specific case of exactly
593	\begin_inset Formula $m$
594	\end_inset
595
596	successes followed by
597	\begin_inset Formula $n-m$
598	\end_inset
599
600	failures, because each success has probability
601	\begin_inset Formula $p$
602	\end_inset
603
604	, each failure has probability
605	\begin_inset Formula $1-p$
606	\end_inset
607
608	, and the trials are independent, the probability of this exact case occurring
609	is
610	\begin_inset Formula $p^{m}\left(1-p\right)^{\left(n-m\right)}$
611	\end_inset
612
613	, the product of the probabilities of the outcome of each trial.
614	\end_layout
615
616	\begin_layout Proof
617	Now consider any reordering of these
618	\begin_inset Formula $m$
619	\end_inset
620
621	successes and
622	\begin_inset Formula $n$
623	\end_inset
624
625	failures.
626	Any such reordering occurs with the same probability
627	\begin_inset Formula $p^{m}\left(1-p\right)^{\left(n-m\right)}$
628	\end_inset
629
630	, but with the terms of the product reordered.
631	Since multiplication is commutative, each such reordering has the same
632	probability.
633	There are n-choose-m such orderings, and each ordering is an independent
634	event, meaning we can sum the probabilities of the individual orderings,
635	so the probability that any ordering of
636	\begin_inset Formula $m$
637	\end_inset
638
639	successes and
640	\begin_inset Formula $n-m$
641	\end_inset
642
643	failures occurs is given by
644	\begin_inset Formula \[
645	\binom{n}{m}p^{m}\left(1-p\right)^{\left(n-m\right)}\]
646
647	\end_inset
648
649	which is the right-hand-side of equation
650	\begin_inset CommandInset ref
651	LatexCommand ref
652	reference "eq:binomial-pmf"
653
654	\end_inset
655
656	.
657	\end_layout
658
659	\begin_layout Standard
660	A file survives if at least
661	\begin_inset Formula $k$
662	\end_inset
663
664	of the
665	\begin_inset Formula $N$
666	\end_inset
667
668	shares survive.
669	Equation
670	\begin_inset CommandInset ref
671	LatexCommand ref
672	reference "eq:binomial-pmf"
673
674	\end_inset
675
676	gives the probability that exactly
677	\begin_inset Formula $i$
678	\end_inset
679
680	shares survive, for any
681	\begin_inset Formula $1\leq i\leq n$
682	\end_inset
683
684	, so the probability that fewer than
685	\begin_inset Formula $k$
686	\end_inset
687
688	survive is the sum of the probabilities that
689	\begin_inset Formula $0,1,2,\ldots,k-1$
690	\end_inset
691
692	shares survive.
693	That is:
694	\end_layout
695
696	\begin_layout Standard
697	\begin_inset Formula \begin{equation}
698	Pr[file\, lost]=\sum_{i=0}^{k-1}\binom{n}{i}p^{i}(1-p)^{n-i}\label{eq:simple-failure}\end{equation}
699
700	\end_inset
701
702
703	\end_layout
704
705	\begin_layout Subsection
706	Independent Reliability
707	\begin_inset CommandInset label
708	LatexCommand label
709	name "sub:Independent-Reliability"
710
711	\end_inset
712
713
714	\end_layout
715
716	\begin_layout Standard
717	Equation
718	\begin_inset CommandInset ref
719	LatexCommand ref
720	reference "eq:simple-failure"
721
722	\end_inset
723
724	assumes that all shares have the same probability of survival, but as explained
725	above, this is not necessarily true.
726	A more accurate model allows each share
727	\begin_inset Formula $s_{i}$
728	\end_inset
729
730	an independent probability of survival
731	\begin_inset Formula $p_{i}$
732	\end_inset
733
734	.
735	Each share's survival can still be treated as an independent Bernoulli
736	trial, but with success probability
737	\begin_inset Formula $p_{i}$
738	\end_inset
739
740	.
741	Under this assumption,
742	\begin_inset Formula $K$
743	\end_inset
744
745	follows a generalized binomial distribution with parameters
746	\begin_inset Formula $N$
747	\end_inset
748
749	and
750	\begin_inset Formula $p_{1},p_{2},\dots,p_{N}$
751	\end_inset
752
753	.
754	\end_layout
755
756	\begin_layout Standard
757	The PMF for this generalized
758	\begin_inset Formula $K$
759	\end_inset
760
761	does not have a simple closed-form representation.
762	However, the PMFs for random variables representing individual share survival
763	do.
764	Let
765	\begin_inset Formula $K_{i}$
766	\end_inset
767
768	be a random variable such that:
769	\end_layout
770
771	\begin_layout Standard
772	\begin_inset Formula \[
773	K_{i}=\begin{cases}
774	1 & \textnormal{if }s_{i}\textnormal{ survives}\\
775	0 & \textnormal{if }s_{i}\textnormal{ fails}\end{cases}\]
776
777	\end_inset
778
779
780	\end_layout
781
782	\begin_layout Standard
783	The PMF for
784	\begin_inset Formula $K_{i}$
785	\end_inset
786
787	is very simple:
788	\begin_inset Formula \[
789	Pr[K_{i}=j]=\begin{cases}
790	p_{i} & j=1\\
791	1-p_{i} & j=0\end{cases}\]
792
793	\end_inset
794
795	which can also be expressed as
796	\begin_inset Formula \[
797	Pr[K_{i}=j]=f\left(j\right)=\left(1-p_{i}\right)\left(1-j\right)+p_{i}\left(j\right)\]
798
799	\end_inset
800
801
802	\end_layout
803
804	\begin_layout Standard
805	Note that since each
806	\begin_inset Formula $K_{i}$
807	\end_inset
808
809	represents the count of shares
810	\begin_inset Formula $s_{i}$
811	\end_inset
812
813	that survives (either 0 or 1), if we add up all of the individual survivor
814	counts, we get the group survivor count.
815	That is:
816	\begin_inset Formula \[
817	\sum_{i=1}^{N}K_{i}=K\]
818
819	\end_inset
820
821	Effectively, we have separated
822	\begin_inset Formula $K$
823	\end_inset
824
825	into the series of Bernoulli trials that make it up.
826	\end_layout
827
828	\begin_layout Theorem
829	Discrete Convolution Theorem
830	\end_layout
831
832	\begin_layout Theorem
833	Let
834	\begin_inset Formula $X$
835	\end_inset
836
837	and
838	\begin_inset Formula $Y$
839	\end_inset
840
841	be discrete random variables with probability mass functions given by
842	\begin_inset Formula $Pr\left[X=x\right]=f(x)$
843	\end_inset
844
845	and
846	\begin_inset Formula $Pr\left[Y=y\right]=g(y).$
847	\end_inset
848
849	Let
850	\begin_inset Formula $Z$
851	\end_inset
852
853	be the discrete random random variable obtained by summing
854	\begin_inset Formula $X$
855	\end_inset
856
857	and
858	\begin_inset Formula $Y$
859	\end_inset
860
861	.
862	\end_layout
863
864	\begin_layout Theorem
865	The probability mass function of
866	\begin_inset Formula $Z$
867	\end_inset
868
869	is given by
870	\begin_inset Formula \[
871	Pr[Z=z]=h(z)=\left(f\star g\right)(z)\]
872
873	\end_inset
874
875	where
876	\begin_inset Formula $\star$
877	\end_inset
878
879	denotes the discrete convolution operation:
880	\begin_inset Formula \[
881	\left(f\star g\right)\left(n\right)=\sum_{m=-\infty}^{\infty}f\left(m\right)g\left(m-n\right)\]
882
883	\end_inset
884
885
886	\end_layout
887
888	\begin_layout Proof
889	The proof is beyond the scope of this paper.
890	\end_layout
891
892	\begin_layout Standard
893	If we denote the PMF of
894	\begin_inset Formula $K$
895	\end_inset
896
897	with
898	\begin_inset Formula $f$
899	\end_inset
900
901	and the PMF of
902	\begin_inset Formula $K_{i}$
903	\end_inset
904
905	with
906	\begin_inset Formula $g_{i}$
907	\end_inset
908
909	(more formally,
910	\begin_inset Formula $Pr[K=x]=f(x)$
911	\end_inset
912
913	and
914	\begin_inset Formula $Pr[K_{i}=x]=g_{i}(x)$
915	\end_inset
916
917	) then since
918	\begin_inset Formula $K=\sum_{i=1}^{N}K_{i}$
919	\end_inset
920
921	, according to the discrete convolution theorem
922	\begin_inset Formula $f=g_{1}\star g_{2}\star g_{3}\star\ldots\star g_{N}$
923	\end_inset
924
925	.
926	Since convolution is associative, this can also be written as
927	\begin_inset Formula $ $
928	\end_inset
929
930
931	\begin_inset Formula \begin{equation}
932	f=(\ldots((g_{1}\star g_{2})\star g_{3})\star\ldots)\star g_{N})\label{eq:convolution}\end{equation}
933
934	\end_inset
935
936	Therefore,
937	\begin_inset Formula $f$
938	\end_inset
939
940	can be computed as a sequence of convolution operations on the simple PMFs
941	of the random variables
942	\begin_inset Formula $K_{i}$
943	\end_inset
944
945	.
946	In fact, for large
947	\begin_inset Formula $N$
948	\end_inset
949
950	, equation
951	\begin_inset CommandInset ref
952	LatexCommand ref
953	reference "eq:convolution"
954
955	\end_inset
956
957	turns out to be a more effective means of computing the PMF of
958	\begin_inset Formula $K$
959	\end_inset
960
961	than the binomial theorem.
962	even in the case of shares with identical survival probability.
963	The reason it's better is because the calculation of
964	\begin_inset Formula $\binom{n}{m}$
965	\end_inset
966
967	in equation
968	\begin_inset CommandInset ref
969	LatexCommand ref
970	reference "eq:binomial-pmf"
971
972	\end_inset
973
974	produces very large values that overflow unless arbitrary precision numeric
975	representations are used.
976	\end_layout
977
978	\begin_layout Standard
979	Note also that it is not necessary to have very simple PMFs like those of
980	the
981	\begin_inset Formula $K_{i}$
982	\end_inset
983
984	.
985	Any share or set of shares that has a known PMF can be combined with any
986	other set with a known PMF by convolution, as long as the two share sets
987	are independent.
988	The reverse holds as well; given a group with an empirically-derived PMF,
989	in it's theoretically possible to solve for an individual PMF, and thereby
990	determine
991	\begin_inset Formula $p_{i}$
992	\end_inset
993
994	even when per-share data is unavailable.
995	\end_layout
996
997	\begin_layout Subsection
998	Multiple Failure Modes
999	\begin_inset CommandInset label
1000	LatexCommand label
1001	name "sub:Multiple-Failure-Modes"
1002
1003	\end_inset
1004
1005
1006	\end_layout
1007
1008	\begin_layout Standard
1009	In modeling share survival probabilities, it's useful to be able to analyze
1010	separately each of the various failure modes.
1011	For example, if reliable statistics for disk failure can be obtained, then
1012	a probability mass function for that form of failure can be generated.
1013	Similarly, statistics on other hardware failures, administrative errors,
1014	network losses, etc., can all be estimated independently.
1015	If those estimates can then be combined into a single PMF for a share,
1016	then we can use it to predict failures for that share.
1017	\end_layout
1018
1019	\begin_layout Standard
1020	Combining independent failure modes for a single share is straightforward.
1021	If
1022	\begin_inset Formula $p_{i,j}$
1023	\end_inset
1024
1025	is the probability of survival of the
1026	\begin_inset Formula $j$
1027	\end_inset
1028
1029	th failure mode of share
1030	\begin_inset Formula $i$
1031	\end_inset
1032
1033	,
1034	\begin_inset Formula $1\leq j\leq m$
1035	\end_inset
1036
1037	, then
1038	\begin_inset Formula \[
1039	Pr[K_{i}=k]=f_{i}(k)=\begin{cases}
1040	\prod_{j=1}^{m}p_{i,j} & k=1\\
1041	1-\prod_{j=1}^{m}p_{i,j} & k=0\end{cases}\]
1042
1043	\end_inset
1044
1045	is the survival PMF.
1046	\end_layout
1047
1048	\begin_layout Subsection
1049	Multi-share failures
1050	\begin_inset CommandInset label
1051	LatexCommand label
1052	name "sub:Multi-share-failures"
1053
1054	\end_inset
1055
1056
1057	\end_layout
1058
1059	\begin_layout Standard
1060	If there are failure modes that affect multiple computers, we can also construct
1061	the PMF that predicts their survival.
1062	The key observation is that the PMF has non-zero probabilities only for
1063
1064	\begin_inset Formula $0$
1065	\end_inset
1066
1067	survivors and
1068	\begin_inset Formula $n$
1069	\end_inset
1070
1071	survivors, where
1072	\begin_inset Formula $n$
1073	\end_inset
1074
1075	is the number of shares in the set.
1076	If
1077	\begin_inset Formula $p$
1078	\end_inset
1079
1080	is the probability of survival, the PMF of
1081	\begin_inset Formula $K$
1082	\end_inset
1083
1084	, a random variable representing the number of survivors is
1085	\begin_inset Formula \[
1086	Pr[K=k]=f(k)=\begin{cases}
1087	p & k=n\\
1088	0 & 0<i<n\\
1089	1-p & k=0\end{cases}\]
1090
1091	\end_inset
1092
1093
1094	\end_layout
1095
1096	\begin_layout Standard
1097	Group failures due to multiple independent causes can be combined as in
1098	section
1099	\begin_inset CommandInset ref
1100	LatexCommand ref
1101	reference "sub:Multiple-Failure-Modes"
1102
1103	\end_inset
1104
1105	, as long as they apply to the whole group.
1106	\end_layout
1107
1108	\begin_layout Example
1109	Putting the Pieces Together
1110	\end_layout
1111
1112	\begin_layout Standard
1113	Sections
1114	\begin_inset CommandInset ref
1115	LatexCommand ref
1116	reference "sub:Fixed-Reliability"
1117
1118	\end_inset
1119
1120	through
1121	\begin_inset CommandInset ref
1122	LatexCommand ref
1123	reference "sub:Multi-share-failures"
1124
1125	\end_inset
1126
1127	provide ways of calculating the survival probability mass functions for
1128	a variety of share failure structures and modes.
1129	As an example of how these pieces can be used, consider a network with
1130	the following peers:
1131	\end_layout
1132
1133	\begin_layout Itemize
1134	Four servers located in a data center in Nebraska.
1135	The machines have multiply-redundant Internet connections, with a failure
1136	probability of 0.0001.
1137	They store their shares on RAID arrays with failure probability of 0.0002.
1138	The administrative staff makes data-destroying errors with probability
1139	0.003.
1140	\end_layout
1141
1142	\begin_layout Itemize
1143	Four servers located in a data center on the island of Hawaii.
1144	These servers have identical failure probabilities as the servers in Nebraska,
1145	except that the data center is near the edge of the crater on Mount Kilauea
1146	(nobody said examples had to be realistic).
1147	There is a 0.04 chance that the volcano will erupt and bury the data center
1148	in molten lava, destroying it entirely.
1149	\end_layout
1150
1151	\begin_layout Itemize
1152	Four PCs located in random homes, connected to the Internet via assorted
1153	cable modems and DSL.
1154	Their network connections fail with probability 0.009.
1155	Their disks fail with probability 0.001.
1156	Their users destroy data with probability 0.05.
1157	\end_layout
1158
1159	\begin_layout Standard
1160	If one share is placed on each of these 12 computers, what's the probability
1161	mass function of share survival? To more compactly describe PMFs, we'll
1162	denote them as probability vectors of the form
1163	\begin_inset Formula $\left[\alpha_{o},\alpha_{1},\alpha_{2},\ldots\alpha_{n}\right]$
1164	\end_inset
1165
1166	where
1167	\begin_inset Formula $\alpha_{i}$
1168	\end_inset
1169
1170	is the probability that exactly
1171	\begin_inset Formula $i$
1172	\end_inset
1173
1174	shares survive.
1175	\end_layout
1176
1177	\begin_layout Standard
1178	The servers in the two data centers have individual failure probabilities
1179	of RAID failure (.0002) and administrative error (.003) giving an individual
1180	survival probability of
1181	\begin_inset Formula \[
1182	(1-.0002)\cdot(1-.003)=.9998\cdot.997=.9968\]
1183
1184	\end_inset
1185
1186
1187	\end_layout
1188
1189	\begin_layout Standard
1190	Using
1191	\begin_inset Formula $p=.9968,n=4$
1192	\end_inset
1193
1194	in equation
1195	\begin_inset CommandInset ref
1196	LatexCommand ref
1197	reference "eq:binomial-pmf"
1198
1199	\end_inset
1200
1201	gives the survival PMF
1202	\begin_inset Formula \[
1203	\left[1.049\times10^{-10},1.307\times10^{-7},6.105\times10^{-5},0.01271,0.9872\right]\]
1204
1205	\end_inset
1206
1207	which applies to each group of four servers.
1208	However, each data center also has a .0001 chance of data connection loss,
1209	which affects all four servers at once, and Hawaii has the additional .04
1210	probability of severe lava burn.
1211	If the network fails at a location, all the machines go offline together.
1212	The probability that 0 machines survive is the probability that they all
1213	fail for individual reasons (
1214	\begin_inset Formula $1.049\cdot10^{-10}$
1215	\end_inset
1216
1217	) plus the probability they all fail because of a network outage (
1218	\begin_inset Formula $.0001$
1219	\end_inset
1220
1221	) less the probability they fail for both reasons:
1222	\begin_inset Formula \[
1223	\left(1.049\times10^{-10}\right)+\left(0.0001\right)-\left[\left(1.049\times10^{-10}\right)\cdot\left(0.0001\right)\right]\approxeq0.0001\]
1224
1225	\end_inset
1226
1227
1228	\end_layout
1229
1230	\begin_layout Standard
1231	That's the
1232	\begin_inset Formula $i=0$
1233	\end_inset
1234
1235	element of the combined PMF.
1236	The combined probability of survival of
1237	\begin_inset Formula $0<i\leq4$
1238	\end_inset
1239
1240	servers is simpler: it's the probability they survive individual failure,
1241	from the individual failure PMF above, times the probability they survive
1242	network failure (.9999).
1243	So the combined survival PMF, which we'll denote as
1244	\begin_inset Formula $n(i)$
1245	\end_inset
1246
1247	of the Nebraska servers is
1248	\begin_inset Formula \[
1249	n(i)=\left[0.0001,1.306\times10^{-7},6.104\times10^{-5},0.01268,0.9872\right]\]
1250
1251	\end_inset
1252
1253	which has the interesting property that complete failure is 1000 times more
1254	likely than survival of one server.
1255	This is because the probability of a network outage is so much greater
1256	than simultaneous
1257	\begin_inset Foot
1258	status collapsed
1259
1260	\begin_layout Plain Layout
1261	Of course, the failures need not be truly simultaneous, they just have happen
1262	in the same interval between repair runs.
1263	\end_layout
1264
1265	\end_inset
1266
1267	independent failure of three servers.
1268	\end_layout
1269
1270	\begin_layout Standard
1271	We apply the same process for the Hawaii servers, but with group survival
1272	probability of
1273	\begin_inset Formula $(1-.0001)(1-.04)=.9799$
1274	\end_inset
1275
1276	gives the survival PMF
1277	\begin_inset Formula \[
1278	h(i)=\left[0.0201,1.280\times10^{-7},5.982\times10^{-5},0.01242,0.9674\right]\]
1279
1280	\end_inset
1281
1282
1283	\end_layout
1284
1285	\begin_layout Standard
1286	Applying the convolution operator to
1287	\begin_inset Formula $n(i)$
1288	\end_inset
1289
1290	and
1291	\begin_inset Formula $h(i)$
1292	\end_inset
1293
1294	, the survival PMF of all eight servers is:
1295	\end_layout
1296
1297	\begin_layout Standard
1298	\begin_inset Formula \[
1299	\left(n\star h\right)\left(i\right)=\begin{cases}
1300	2.010\times10^{-6} & i=0\\
1301	2.639\times10^{-9} & i=1\\
1302	1.233\times10^{-6} & i=2\\
1303	2.560\times10^{-4} & i=3\\
1304	0.01994 & i=4\\
1305	1.769\times10^{-6} & i=5\\
1306	2.756\times10^{-4} & i=6\\
1307	0.02452 & i=7\\
1308	0.9559 & i=8\end{cases}\]
1309
1310	\end_inset
1311
1312
1313	\end_layout
1314
1315	\begin_layout Standard
1316	\begin_inset VSpace defskip
1317	\end_inset
1318
1319
1320	\end_layout
1321
1322	\begin_layout Standard
1323	Note that losing four shares (
1324	\begin_inset Formula $i=4$
1325	\end_inset
1326
1327	) is 10,000 times more likely than losing three (
1328	\begin_inset Formula $i=5$
1329	\end_inset
1330
1331	).
1332	This is because both data centers have a whole-center failure mode, and
1333	the Hawaii center's lava burn probability is so high.
1334	Similarly, the probability of losing all of them is 1000 times higher than
1335	the probability of losing all but one.
1336	\end_layout
1337
1338	\begin_layout Standard
1339	For the home PCs, their individual probability of survival is
1340	\begin_inset Formula \[
1341	(1-.009)\cdot(1-.001)\cdot(1-.05)=.991\cdot.999\cdot.95=.9405\]
1342
1343	\end_inset
1344
1345
1346	\end_layout
1347
1348	\begin_layout Standard
1349	We can then apply equation
1350	\begin_inset CommandInset ref
1351	LatexCommand ref
1352	reference "eq:binomial-pmf"
1353
1354	\end_inset
1355
1356	with
1357	\begin_inset Formula $N=4$
1358	\end_inset
1359
1360	and
1361	\begin_inset Formula $p=.9405$
1362	\end_inset
1363
1364	to compute the PMF
1365	\begin_inset Formula $g(i),0\leq i\leq4$
1366	\end_inset
1367
1368	for the PCs and finally compute
1369	\begin_inset Formula $f(i)=\left(g\star\left(n\star h\right)\right)\left(i\right)$
1370	\end_inset
1371
1372	, the PMF of the whole share set.
1373	Summing the values of
1374	\begin_inset Formula $f(i)$
1375	\end_inset
1376
1377	for
1378	\begin_inset Formula $0\leq i\leq k-1$
1379	\end_inset
1380
1381	gives the probability that less than
1382	\begin_inset Formula $k$
1383	\end_inset
1384
1385	shares survive and the file is unrecoverable.
1386	For this example, those sums are shown in table
1387	\begin_inset CommandInset ref
1388	LatexCommand vref
1389	reference "tab:Example-PMF"
1390
1391	\end_inset
1392
1393	.
1394	\begin_inset Float table
1395	wide false
1396	sideways false
1397	status collapsed
1398
1399	\begin_layout Plain Layout
1400	\align center
1401	\begin_inset Tabular
1402	<lyxtabular version="3" rows="13" columns="4">
1403	<features>
1404	<column alignment="center" valignment="top" width="0">
1405	<column alignment="center" valignment="top" width="0">
1406	<column alignment="center" valignment="top" width="0">
1407	<column alignment="center" valignment="top" width="0">
1408	<row>
1409	<cell alignment="center" valignment="top" topline="true" bottomline="true" leftline="true" usebox="none">
1410	\begin_inset Text
1411
1412	\begin_layout Plain Layout
1413	\begin_inset Formula $k$
1414	\end_inset
1415
1416
1417	\end_layout
1418
1419	\end_inset
1420	</cell>
1421	<cell alignment="center" valignment="top" topline="true" bottomline="true" leftline="true" usebox="none">
1422	\begin_inset Text
1423
1424	\begin_layout Plain Layout
1425	\begin_inset Formula $Pr[K=k]$
1426	\end_inset
1427
1428
1429	\end_layout
1430
1431	\end_inset
1432	</cell>
1433	<cell alignment="center" valignment="top" topline="true" bottomline="true" leftline="true" usebox="none">
1434	\begin_inset Text
1435
1436	\begin_layout Plain Layout
1437	\begin_inset Formula $Pr[file\, loss]=Pr[K<k]$
1438	\end_inset
1439
1440
1441	\end_layout
1442
1443	\end_inset
1444	</cell>
1445	<cell alignment="center" valignment="top" topline="true" bottomline="true" leftline="true" rightline="true" usebox="none">
1446	\begin_inset Text
1447
1448	\begin_layout Plain Layout
1449	\begin_inset Formula $N/k$
1450	\end_inset
1451
1452
1453	\end_layout
1454
1455	\end_inset
1456	</cell>
1457	</row>
1458	<row>
1459	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1460	\begin_inset Text
1461
1462	\begin_layout Plain Layout
1463	1
1464	\end_layout
1465
1466	\end_inset
1467	</cell>
1468	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1469	\begin_inset Text
1470
1471	\begin_layout Plain Layout
1472	\begin_inset Formula $1.60\times10^{-9}$
1473	\end_inset
1474
1475
1476	\end_layout
1477
1478	\end_inset
1479	</cell>
1480	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1481	\begin_inset Text
1482
1483	\begin_layout Plain Layout
1484	\begin_inset Formula $2.53\times10^{-11}$
1485	\end_inset
1486
1487
1488	\end_layout
1489
1490	\end_inset
1491	</cell>
1492	<cell alignment="center" valignment="top" topline="true" leftline="true" rightline="true" usebox="none">
1493	\begin_inset Text
1494
1495	\begin_layout Plain Layout
1496	12
1497	\end_layout
1498
1499	\end_inset
1500	</cell>
1501	</row>
1502	<row>
1503	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1504	\begin_inset Text
1505
1506	\begin_layout Plain Layout
1507	2
1508	\end_layout
1509
1510	\end_inset
1511	</cell>
1512	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1513	\begin_inset Text
1514
1515	\begin_layout Plain Layout
1516	\begin_inset Formula $3.80\times10^{-8}$
1517	\end_inset
1518
1519
1520	\end_layout
1521
1522	\end_inset
1523	</cell>
1524	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1525	\begin_inset Text
1526
1527	\begin_layout Plain Layout
1528	\begin_inset Formula $1.63\times10^{-9}$
1529	\end_inset
1530
1531
1532	\end_layout
1533
1534	\end_inset
1535	</cell>
1536	<cell alignment="center" valignment="top" topline="true" leftline="true" rightline="true" usebox="none">
1537	\begin_inset Text
1538
1539	\begin_layout Plain Layout
1540	6
1541	\end_layout
1542
1543	\end_inset
1544	</cell>
1545	</row>
1546	<row>
1547	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1548	\begin_inset Text
1549
1550	\begin_layout Plain Layout
1551	3
1552	\end_layout
1553
1554	\end_inset
1555	</cell>
1556	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1557	\begin_inset Text
1558
1559	\begin_layout Plain Layout
1560	\begin_inset Formula $4.04\times10^{-7}$
1561	\end_inset
1562
1563
1564	\end_layout
1565
1566	\end_inset
1567	</cell>
1568	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1569	\begin_inset Text
1570
1571	\begin_layout Plain Layout
1572	\begin_inset Formula $3.70\times10^{-8}$
1573	\end_inset
1574
1575
1576	\end_layout
1577
1578	\end_inset
1579	</cell>
1580	<cell alignment="center" valignment="top" topline="true" leftline="true" rightline="true" usebox="none">
1581	\begin_inset Text
1582
1583	\begin_layout Plain Layout
1584	4
1585	\end_layout
1586
1587	\end_inset
1588	</cell>
1589	</row>
1590	<row>
1591	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1592	\begin_inset Text
1593
1594	\begin_layout Plain Layout
1595	4
1596	\end_layout
1597
1598	\end_inset
1599	</cell>
1600	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1601	\begin_inset Text
1602
1603	\begin_layout Plain Layout
1604	\begin_inset Formula $2.06\times10^{-6}$
1605	\end_inset
1606
1607
1608	\end_layout
1609
1610	\end_inset
1611	</cell>
1612	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1613	\begin_inset Text
1614
1615	\begin_layout Plain Layout
1616	\begin_inset Formula $4.44\times10^{-7}$
1617	\end_inset
1618
1619
1620	\end_layout
1621
1622	\end_inset
1623	</cell>
1624	<cell alignment="center" valignment="top" topline="true" leftline="true" rightline="true" usebox="none">
1625	\begin_inset Text
1626
1627	\begin_layout Plain Layout
1628	3
1629	\end_layout
1630
1631	\end_inset
1632	</cell>
1633	</row>
1634	<row>
1635	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1636	\begin_inset Text
1637
1638	\begin_layout Plain Layout
1639	5
1640	\end_layout
1641
1642	\end_inset
1643	</cell>
1644	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1645	\begin_inset Text
1646
1647	\begin_layout Plain Layout
1648	\begin_inset Formula $2.10\times10^{-5}$
1649	\end_inset
1650
1651
1652	\end_layout
1653
1654	\end_inset
1655	</cell>
1656	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1657	\begin_inset Text
1658
1659	\begin_layout Plain Layout
1660	\begin_inset Formula $2.50\times10^{-6}$
1661	\end_inset
1662
1663
1664	\end_layout
1665
1666	\end_inset
1667	</cell>
1668	<cell alignment="center" valignment="top" topline="true" leftline="true" rightline="true" usebox="none">
1669	\begin_inset Text
1670
1671	\begin_layout Plain Layout
1672	2.4
1673	\end_layout
1674
1675	\end_inset
1676	</cell>
1677	</row>
1678	<row>
1679	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1680	\begin_inset Text
1681
1682	\begin_layout Plain Layout
1683	6
1684	\end_layout
1685
1686	\end_inset
1687	</cell>
1688	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1689	\begin_inset Text
1690
1691	\begin_layout Plain Layout
1692	\begin_inset Formula $0.000428$
1693	\end_inset
1694
1695
1696	\end_layout
1697
1698	\end_inset
1699	</cell>
1700	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1701	\begin_inset Text
1702
1703	\begin_layout Plain Layout
1704	\begin_inset Formula $2.35\times10^{-5}$
1705	\end_inset
1706
1707
1708	\end_layout
1709
1710	\end_inset
1711	</cell>
1712	<cell alignment="center" valignment="top" topline="true" leftline="true" rightline="true" usebox="none">
1713	\begin_inset Text
1714
1715	\begin_layout Plain Layout
1716	2
1717	\end_layout
1718
1719	\end_inset
1720	</cell>
1721	</row>
1722	<row>
1723	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1724	\begin_inset Text
1725
1726	\begin_layout Plain Layout
1727	7
1728	\end_layout
1729
1730	\end_inset
1731	</cell>
1732	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1733	\begin_inset Text
1734
1735	\begin_layout Plain Layout
1736	\begin_inset Formula $0.00417$
1737	\end_inset
1738
1739
1740	\end_layout
1741
1742	\end_inset
1743	</cell>
1744	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1745	\begin_inset Text
1746
1747	\begin_layout Plain Layout
1748	\begin_inset Formula $0.000452$
1749	\end_inset
1750
1751
1752	\end_layout
1753
1754	\end_inset
1755	</cell>
1756	<cell alignment="center" valignment="top" topline="true" leftline="true" rightline="true" usebox="none">
1757	\begin_inset Text
1758
1759	\begin_layout Plain Layout
1760	1.7
1761	\end_layout
1762
1763	\end_inset
1764	</cell>
1765	</row>
1766	<row>
1767	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1768	\begin_inset Text
1769
1770	\begin_layout Plain Layout
1771	8
1772	\end_layout
1773
1774	\end_inset
1775	</cell>
1776	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1777	\begin_inset Text
1778
1779	\begin_layout Plain Layout
1780	\begin_inset Formula $0.0157$
1781	\end_inset
1782
1783
1784	\end_layout
1785
1786	\end_inset
1787	</cell>
1788	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1789	\begin_inset Text
1790
1791	\begin_layout Plain Layout
1792	\begin_inset Formula $0.00462$
1793	\end_inset
1794
1795
1796	\end_layout
1797
1798	\end_inset
1799	</cell>
1800	<cell alignment="center" valignment="top" topline="true" leftline="true" rightline="true" usebox="none">
1801	\begin_inset Text
1802
1803	\begin_layout Plain Layout
1804	1.5
1805	\end_layout
1806
1807	\end_inset
1808	</cell>
1809	</row>
1810	<row>
1811	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1812	\begin_inset Text
1813
1814	\begin_layout Plain Layout
1815	9
1816	\end_layout
1817
1818	\end_inset
1819	</cell>
1820	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1821	\begin_inset Text
1822
1823	\begin_layout Plain Layout
1824	\begin_inset Formula $0.00127$
1825	\end_inset
1826
1827
1828	\end_layout
1829
1830	\end_inset
1831	</cell>
1832	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1833	\begin_inset Text
1834
1835	\begin_layout Plain Layout
1836	\begin_inset Formula $0.0203$
1837	\end_inset
1838
1839
1840	\end_layout
1841
1842	\end_inset
1843	</cell>
1844	<cell alignment="center" valignment="top" topline="true" leftline="true" rightline="true" usebox="none">
1845	\begin_inset Text
1846
1847	\begin_layout Plain Layout
1848	1.3
1849	\end_layout
1850
1851	\end_inset
1852	</cell>
1853	</row>
1854	<row>
1855	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1856	\begin_inset Text
1857
1858	\begin_layout Plain Layout
1859	10
1860	\end_layout
1861
1862	\end_inset
1863	</cell>
1864	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1865	\begin_inset Text
1866
1867	\begin_layout Plain Layout
1868	\begin_inset Formula $0.0230$
1869	\end_inset
1870
1871
1872	\end_layout
1873
1874	\end_inset
1875	</cell>
1876	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1877	\begin_inset Text
1878
1879	\begin_layout Plain Layout
1880	\begin_inset Formula $0.0216$
1881	\end_inset
1882
1883
1884	\end_layout
1885
1886	\end_inset
1887	</cell>
1888	<cell alignment="center" valignment="top" topline="true" leftline="true" rightline="true" usebox="none">
1889	\begin_inset Text
1890
1891	\begin_layout Plain Layout
1892	1.2
1893	\end_layout
1894
1895	\end_inset
1896	</cell>
1897	</row>
1898	<row>
1899	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1900	\begin_inset Text
1901
1902	\begin_layout Plain Layout
1903	11
1904	\end_layout
1905
1906	\end_inset
1907	</cell>
1908	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1909	\begin_inset Text
1910
1911	\begin_layout Plain Layout
1912	\begin_inset Formula $0.208$
1913	\end_inset
1914
1915
1916	\end_layout
1917
1918	\end_inset
1919	</cell>
1920	<cell alignment="center" valignment="top" topline="true" leftline="true" usebox="none">
1921	\begin_inset Text
1922
1923	\begin_layout Plain Layout
1924	\begin_inset Formula $0.0446$
1925	\end_inset
1926
1927
1928	\end_layout
1929
1930	\end_inset
1931	</cell>
1932	<cell alignment="center" valignment="top" topline="true" leftline="true" rightline="true" usebox="none">
1933	\begin_inset Text
1934
1935	\begin_layout Plain Layout
1936	1.1
1937	\end_layout
1938
1939	\end_inset
1940	</cell>
1941	</row>
1942	<row>
1943	<cell alignment="center" valignment="top" topline="true" bottomline="true" leftline="true" usebox="none">
1944	\begin_inset Text
1945
1946	\begin_layout Plain Layout
1947	12
1948	\end_layout
1949
1950	\end_inset
1951	</cell>
1952	<cell alignment="center" valignment="top" topline="true" bottomline="true" leftline="true" usebox="none">
1953	\begin_inset Text
1954
1955	\begin_layout Plain Layout
1956	\begin_inset Formula $0.747$
1957	\end_inset
1958
1959
1960	\end_layout
1961
1962	\end_inset
1963	</cell>
1964	<cell alignment="center" valignment="top" topline="true" bottomline="true" leftline="true" usebox="none">
1965	\begin_inset Text
1966
1967	\begin_layout Plain Layout
1968	\begin_inset Formula $0.253$
1969	\end_inset
1970
1971
1972	\end_layout
1973
1974	\end_inset
1975	</cell>
1976	<cell alignment="center" valignment="top" topline="true" bottomline="true" leftline="true" rightline="true" usebox="none">
1977	\begin_inset Text
1978
1979	\begin_layout Plain Layout
1980	1
1981	\end_layout
1982
1983	\end_inset
1984	</cell>
1985	</row>
1986	</lyxtabular>
1987
1988	\end_inset
1989
1990
1991	\end_layout
1992
1993	\begin_layout Plain Layout
1994	\begin_inset Caption
1995
1996	\begin_layout Plain Layout
1997	\align left
1998	\begin_inset CommandInset label
1999	LatexCommand label
2000	name "tab:Example-PMF"
2001
2002	\end_inset
2003
2004	Example PMF
2005	\end_layout
2006
2007	\end_inset
2008
2009
2010	\end_layout
2011
2012	\begin_layout Plain Layout
2013
2014	\end_layout
2015
2016	\end_inset
2017
2018
2019	\end_layout
2020
2021	\begin_layout Standard
2022	The table demonstrates the importance of the selection of
2023	\begin_inset Formula $k$
2024	\end_inset
2025
2026	, and the tradeoff against file size expansion.
2027	Note that the survival of exactly 9 servers is significantly less likely
2028	than the survival of 8 or 10 servers.
2029	This is, again, an artifact of the group failure modes.
2030	Because of this, there is no reason to choose
2031	\begin_inset Formula $k=9$
2032	\end_inset
2033
2034	over
2035	\begin_inset Formula $k=10$
2036	\end_inset
2037
2038	.
2039	Normally, reducing the number of shares needed for reassembly improve the
2040	file's chances of survival, but in this case it provides a minuscule gain
2041	in reliability at the cost of a 10% increase in bandwidth and storage consumed.
2042	\end_layout
2043
2044	\begin_layout Subsection
2045	Share Duplication
2046	\end_layout
2047
2048	\begin_layout Standard
2049	Before moving on to consider issues other than single-interval file loss,
2050	let's analyze one more possibility, that of
2051	\begin_inset Quotes eld
2052	\end_inset
2053
2054	cheap
2055	\begin_inset Quotes erd
2056	\end_inset
2057
2058	file repair via share duplication.
2059	\end_layout
2060
2061	\begin_layout Standard
2062	Initially, files are split using erasure coding, which creates
2063	\begin_inset Formula $N$
2064	\end_inset
2065
2066	unique shares, any
2067	\begin_inset Formula $k$
2068	\end_inset
2069
2070	of which can be used to to reconstruct the file.
2071	When shares are lost, proper repair downloads some
2072	\begin_inset Formula $k$
2073	\end_inset
2074
2075	shares, reconstructs the original file and then uses the erasure coding
2076	algorithm to reconstruct the lost shares, then redeploys them to peers
2077	in the network.
2078	This is a somewhat expensive process.
2079	\end_layout
2080
2081	\begin_layout Standard
2082	A cheaper repair option is simply to direct some peer that has share
2083	\begin_inset Formula $s_{i}$
2084	\end_inset
2085
2086	to send a copy to another peer, thus increasing by one the number of shares
2087	in the network.
2088	This is not as good as actually replacing the lost share, though.
2089	Suppose that more shares were lost, leaving only
2090	\begin_inset Formula $k$
2091	\end_inset
2092
2093	shares remaining.
2094	If two of those shares are identical, because one was duplicated in this
2095	fashion, then only
2096	\begin_inset Formula $k-1$
2097	\end_inset
2098
2099	shares truly remain, and the file can no longer be reconstructed.
2100	\end_layout
2101
2102	\begin_layout Standard
2103	However, such cheap repair is not completely pointless; it does increase
2104	file survivability.
2105	But by how much?
2106	\end_layout
2107
2108	\begin_layout Standard
2109	Effectively, share duplication simply increases the probability that
2110	\begin_inset Formula $s_{i}$
2111	\end_inset
2112
2113	will survive, by providing two locations from which to retrieve it.
2114	We can view the two copies of the single share as one, but with a higher
2115	probability of survival than would be provided by either of the two peers.
2116	In particular, if
2117	\begin_inset Formula $p_{1}$
2118	\end_inset
2119
2120	and
2121	\begin_inset Formula $p_{2}$
2122	\end_inset
2123
2124	are the probabilities that the two peers will survive, respectively, then
2125	\begin_inset Formula \[
2126	Pr[s_{i}\, survives]=p_{1}+p_{2}-p_{1}p_{2}\]
2127
2128	\end_inset
2129
2130
2131	\end_layout
2132
2133	\begin_layout Standard
2134	More generally, if a single share is deployed on
2135	\begin_inset Formula $n$
2136	\end_inset
2137
2138	peers, each with a PMF
2139	\begin_inset Formula $f_{i}(j),0\leq j\leq1,1\leq i\leq n$
2140	\end_inset
2141
2142	, the share survival count is a random variable
2143	\begin_inset Formula $K$
2144	\end_inset
2145
2146	and the probability of share loss is
2147	\begin_inset Formula \[
2148	Pr[K=0]=(f_{1}\star f_{2}\star\ldots\star f_{n})(0)\]
2149
2150	\end_inset
2151
2152
2153	\end_layout
2154
2155	\begin_layout Standard
2156	From that, we can construct a share PMF in the obvious way, which can then
2157	be convolved with the other share PMFs to produce the share set PMF.
2158	\end_layout
2159
2160	\begin_layout Example
2161	Suppose a file has
2162	\begin_inset Formula $N=10,k=3$
2163	\end_inset
2164
2165	and that all servers have survival probability
2166	\begin_inset Formula $p=.9$
2167	\end_inset
2168
2169	.
2170	Given a full complement of shares,
2171	\begin_inset Formula $Pr[\textrm{file\, loss}]=3.74\times10^{-7}$
2172	\end_inset
2173
2174	.
2175	Suppose that four shares are lost, which increases
2176	\begin_inset Formula $Pr[\textrm{file\, loss}]$
2177	\end_inset
2178
2179	to
2180	\begin_inset Formula $.00127$
2181	\end_inset
2182
2183	, a value
2184	\begin_inset Formula $3400$
2185	\end_inset
2186
2187	times greater.
2188	Rather than doing a proper reconstruction, we could direct four peers still
2189	holding shares to send a copy of their share to new peer, which changes
2190	the composition of the shares from one of six, unique
2191	\begin_inset Quotes eld
2192	\end_inset
2193
2194	standard
2195	\begin_inset Quotes erd
2196	\end_inset
2197
2198	shares, to one of two standard shares, each with survival probability
2199	\begin_inset Formula $.9$
2200	\end_inset
2201
2202	and four
2203	\begin_inset Quotes eld
2204	\end_inset
2205
2206	doubled
2207	\begin_inset Quotes erd
2208	\end_inset
2209
2210	shares, each with survival probability
2211	\begin_inset Formula $2p-p^{2}\approxeq.99$
2212	\end_inset
2213
2214	.
2215	\end_layout
2216
2217	\begin_layout Example
2218	Combining the two single-peer share PMFs with the four double-share PMFs
2219	gives a new file survival probability of
2220	\begin_inset Formula $6.64\times10^{-6}$
2221	\end_inset
2222
2223	.
2224	Not as good as a full repair, but still quite respectable.
2225	Also, if storage were not a concern, all six shares could be duplicated,
2226	for a
2227	\begin_inset Formula $Pr[file\, loss]=1.48\times10^{-7}$
2228	\end_inset
2229
2230	, which is actually three time better than the nominal case.
2231	\end_layout
2232
2233	\begin_layout Example
2234	The reason such cheap repairs may be attractive in many cases is that distribute
2235	d bandwidth is cheaper than bandwidth through a single peer.
2236	This is particularly true if that single peer has a very slow connection,
2237	which is common for home computers -- especially in the outbound direction.
2238	\end_layout
2239
2240	\begin_layout Section
2241	Long-Term Reliability
2242	\end_layout
2243
2244	\begin_layout Standard
2245	Thus far, we've focused entirely on the probability that a file survives
2246	the interval
2247	\begin_inset Formula $A$
2248	\end_inset
2249
2250	between repair times.
2251	The probability that a file survives long-term, though, is also important.
2252	As long as the probability of failure during a repair period is non-zero,
2253	a given file will eventually be lost.
2254	We want to know the probability of surviving for time
2255	\begin_inset Formula $T$
2256	\end_inset
2257
2258	, and how the parameters
2259	\begin_inset Formula $A$
2260	\end_inset
2261
2262	(time between repairs) and
2263	\begin_inset Formula $L$
2264	\end_inset
2265
2266	(allowed share low watermark) affect survival time.
2267	\end_layout
2268
2269	\begin_layout Standard
2270	To model file survival time, let
2271	\begin_inset Formula $T$
2272	\end_inset
2273
2274	be a random variable denoting the time at which a given file becomes unrecovera
2275	ble, and
2276	\begin_inset Formula $R(t)=Pr[T>t]$
2277	\end_inset
2278
2279	be a function that gives the probability that the file survives to time
2280
2281	\begin_inset Formula $t$
2282	\end_inset
2283
2284	.
2285
2286	\begin_inset Formula $R(t)$
2287	\end_inset
2288
2289	is the cumulative distribution function of
2290	\begin_inset Formula $T$
2291	\end_inset
2292
2293	.
2294	\end_layout
2295
2296	\begin_layout Standard
2297	Most survival functions are continuous, but
2298	\begin_inset Formula $R(t)$
2299	\end_inset
2300
2301	is inherently discrete and stochastic.
2302	The time steps are the repair intervals, each of length
2303	\begin_inset Formula $A$
2304	\end_inset
2305
2306	, so
2307	\begin_inset Formula $T$
2308	\end_inset
2309
2310	-values are multiples of
2311	\begin_inset Formula $A$
2312	\end_inset
2313
2314	.
2315	During each interval, the file's shares degrade according to the probability
2316	mass function of
2317	\begin_inset Formula $K$
2318	\end_inset
2319
2320	.
2321	\end_layout
2322
2323	\begin_layout Subsection
2324	Aggressive Repair
2325	\end_layout
2326
2327	\begin_layout Standard
2328	Let's first consider the case of an aggressive repairer.
2329	Every interval, this repairer checks the file for share losses and restores
2330	them.
2331	Thus, at the beginning of each interval, the file always has
2332	\begin_inset Formula $N$
2333	\end_inset
2334
2335	shares, distributed on servers with various individual and group failure
2336	probabilities, which will survive or fail per the output of random variable
2337
2338	\begin_inset Formula $K$
2339	\end_inset
2340
2341	.
2342	\end_layout
2343
2344	\begin_layout Standard
2345	For any interval, then, the probability that the file will survive is
2346	\begin_inset Formula $f\left(k\right)=Pr[K\geq k]$
2347	\end_inset
2348
2349	.
2350	Since each interval success or failure is independent, and assuming the
2351	share reliabilities remain constant over time,
2352	\begin_inset Formula \begin{equation}
2353	R\left(t\right)=f(k)^{t}\end{equation}
2354
2355	\end_inset
2356
2357
2358	\end_layout
2359
2360	\begin_layout Standard
2361	This simple survival function makes it simple to select parameters
2362	\begin_inset Formula $N$
2363	\end_inset
2364
2365	and
2366	\begin_inset Formula $K$
2367	\end_inset
2368
2369	such that
2370	\begin_inset Formula $R(t)\geq r$
2371	\end_inset
2372
2373	, where
2374	\begin_inset Formula $r$
2375	\end_inset
2376
2377	is a user-specified parameter indicating the desired probability of survival
2378	to time
2379	\begin_inset Formula $t$
2380	\end_inset
2381
2382	.
2383	Specifically, we can solve for
2384	\begin_inset Formula $f\left(k\right)$
2385	\end_inset
2386
2387	in
2388	\begin_inset Formula $r\leq f\left(k\right)^{t}$
2389	\end_inset
2390
2391	, giving:
2392	\begin_inset Formula \begin{equation}
2393	f\left(k\right)\geq\sqrt[t]{r}\end{equation}
2394
2395	\end_inset
2396
2397
2398	\end_layout
2399
2400	\begin_layout Standard
2401	So, given a PMF
2402	\begin_inset Formula $f\left(k\right)$
2403	\end_inset
2404
2405	, to assure the survival of a file to time
2406	\begin_inset Formula $t$
2407	\end_inset
2408
2409	with probability at least
2410	\begin_inset Formula $r$
2411	\end_inset
2412
2413	, choose
2414	\begin_inset Formula $k$
2415	\end_inset
2416
2417	such that
2418	\begin_inset Formula $f\left(k\right)\geq\sqrt[t]{r}$
2419	\end_inset
2420
2421	.
2422	For example, if
2423	\begin_inset Formula $A$
2424	\end_inset
2425
2426	is one month, and
2427	\begin_inset Formula $r=1-\nicefrac{1}{10^{6}}$
2428	\end_inset
2429
2430	and
2431	\begin_inset Formula $t=120$
2432	\end_inset
2433
2434	, or 10 years, we calculate
2435	\begin_inset Formula $f\left(k\right)\geq\sqrt[120]{.999999}\approx0.999999992$
2436	\end_inset
2437
2438	.
2439	Per the PMF of table
2440	\begin_inset CommandInset ref
2441	LatexCommand ref
2442	reference "tab:Example-PMF"
2443
2444	\end_inset
2445
2446	, this means
2447	\begin_inset Formula $k=2$
2448	\end_inset
2449
2450	, achieves the goal, at the cost of a six-fold expansion in stored file
2451	size.
2452	If the lesser goal of no more than
2453	\begin_inset Formula $\nicefrac{1}{1000}$
2454	\end_inset
2455
2456	probability of loss is taken, then since
2457	\begin_inset Formula $\sqrt[120]{.9999}=.999992$
2458	\end_inset
2459
2460	,
2461	\begin_inset Formula $k=5$
2462	\end_inset
2463
2464	achieves the goal with an expansion factor of
2465	\begin_inset Formula $2.4$
2466	\end_inset
2467
2468	.
2469	\end_layout
2470
2471	\begin_layout Subsection
2472	Repair Cost
2473	\end_layout
2474
2475	\begin_layout Standard
2476	The simplicity and predictability of aggressive repair is attractive, but
2477	there is a downside: Repairs cost processing power and bandwidth.
2478	The processing power is proportional to the size of the file, since the
2479	whole file must be reconstructed and then re-processed using the Reed-Solomon
2480	algorithm, while the bandwidth cost is proportional to the number of missing
2481	shares that must be replaced,
2482	\begin_inset Formula $N-K$
2483	\end_inset
2484
2485	.
2486	\end_layout
2487
2488	\begin_layout Standard
2489	Let
2490	\begin_inset Formula $c\left(s,d,k\right)$
2491	\end_inset
2492
2493	be a cost function that combines the processing cost of regenerating a
2494	file of size
2495	\begin_inset Formula $s$
2496	\end_inset
2497
2498	and the bandwidth cost of downloading a file of size
2499	\begin_inset Formula $s$
2500	\end_inset
2501
2502	and uploading
2503	\begin_inset Formula $d$
2504	\end_inset
2505
2506	shares each of size
2507	\begin_inset Formula $\nicefrac{s}{k}$
2508	\end_inset
2509
2510	.
2511	Also, let
2512	\begin_inset Formula $D$
2513	\end_inset
2514
2515	denote the random variable
2516	\begin_inset Formula $N-K$
2517	\end_inset
2518
2519	, which is the number of shares that must be redistributed to bring the
2520	file share set back up to
2521	\begin_inset Formula $N$
2522	\end_inset
2523
2524	after degrading during an interval.
2525	The probability mass function of
2526	\begin_inset Formula $D$
2527	\end_inset
2528
2529	is
2530	\begin_inset Formula \[
2531	Pr[D=d]=f(d)=\begin{cases}
2532	Pr\left[K=N\right]+Pr[K<k] & d=0\\
2533	Pr\left[K=N-d\right] & 0<d\leq N-k\\
2534	0 & N-k<d\leq N\end{cases}\]
2535
2536	\end_inset
2537
2538
2539	\end_layout
2540
2541	\begin_layout Standard
2542	The expected cost of repairs in a given interval, then, is simply
2543	\begin_inset Formula $c\left(s,E\left[D\right],k\right)$
2544	\end_inset
2545
2546	where E is the expected value function -- in this case:
2547	\begin_inset Formula \begin{align*}
2548	E[D] & =\sum_{d=0}^{N}d\cdot Pr\left[D=d\right]\\
2549	& =0\cdot Pr\left[D=0\right]+\sum_{d=1}^{N-k}\left\{ d\cdot Pr\left[K=N-d\right]\right\} +\sum_{d=N-k+1}^{N}\left\{ d\cdot0\right\} \\
2550	& =\sum_{d=1}^{N-k}d\cdot Pr\left[K=N-d\right]\end{align*}
2551
2552	\end_inset
2553
2554
2555	\end_layout
2556
2557	\begin_layout Standard
2558	Since each interval starts with a full complement of shares, the expected
2559	repair cost for each interval is the same, and the cost for file that survives
2560	for
2561	\begin_inset Formula $t$
2562	\end_inset
2563
2564	intervals is
2565	\begin_inset Formula $t\cdot c\left(s,E\left[D\right]\right)$
2566	\end_inset
2567
2568	.
2569	To calculate the lifetime repair cost, we just take the limit over all
2570	intervals as
2571	\begin_inset Formula $t\rightarrow\infty$
2572	\end_inset
2573
2574	, discounting each cost by the probability that the file has already failed.
2575	So, the lifetime expected repair cost is
2576	\begin_inset Formula \begin{align*}
2577	\sum_{t=1}^{\infty}R\left(t-1\right)c\left(s,E\left[D\right],k\right) & =c\left(s,E\left[D\right],k\right)\sum_{t=1}^{\infty}R\left(t-1\right)\\
2578	& =c\left(s,E\left[D\right],k\right)\sum_{t=1}^{\infty}f\left(k\right)^{t-1}\\
2579	& =c\left(s,E\left[D\right],k\right)\cdot\frac{1}{1-f\left(k\right)}\\
2580	& =\frac{c\left(s,E\left[D\right],k\right)}{1-f\left(k\right)}\end{align*}
2581
2582	\end_inset
2583
2584
2585	\end_layout
2586
2587	\begin_layout Standard
2588	It is also necessary to discount future cost, since CPU and bandwidth are
2589	both going to get cheaper over time.
2590	To accommodate this, we throw in an addition per-period discount rate
2591	\begin_inset Formula $r$
2592	\end_inset
2593
2594	.
2595	In accordance with common discount rate usage, the discount multiplier
2596	at time
2597	\begin_inset Formula $t$
2598	\end_inset
2599
2600	is
2601	\begin_inset Formula $\left(1-r\right)^{t}$
2602	\end_inset
2603
2604	.
2605	This gives:
2606	\begin_inset Formula \begin{align*}
2607	\sum_{t=1}^{\infty}\left(1-r\right){}^{t}R\left(t-1\right)c\left(s,E\left[D\right],k\right) & =c\left(s,E\left[D\right],k\right)\sum_{t=1}^{\infty}\left(1-r\right)^{t}f\left(k\right)^{t-1}\\
2608	& =c\left(s,E\left[D\right],k\right)\sum_{t=1}^{\infty}\left(1-r\right)^{t}f\left(k\right)^{t-1}\\
2609	& =c\left(s,E\left[D\right],k\right)\left(1-r\right)\sum_{t=1}^{\infty}\left(1-r\right)^{t-1}f\left(k\right)^{t-1}\\
2610	& =\frac{c\left(s,E\left[D\right],k\right)\left(1-r\right)}{1-\left(1-r\right)f\left(k\right)}\end{align*}
2611
2612	\end_inset
2613
2614	If
2615	\begin_inset Formula $r=0$
2616	\end_inset
2617
2618	this collapses to the previous result, as one would expect.
2619	\end_layout
2620
2621	\begin_layout Subsection
2622	Non-aggressive Repair
2623	\end_layout
2624
2625	\begin_layout Standard
2626	Need to write this.
2627	\end_layout
2628
2629	\begin_layout Section
2630	Time-Sensitive Retrieval
2631	\end_layout
2632
2633	\begin_layout Standard
2634	The above work has almost entirely ignored the distinction between availability
2635	and reliability.
2636	\end_layout
2637
2638	\begin_layout Standard
2639	Need to write this.
2640	\end_layout
2641
2642	\end_body
2643	\end_document

Note: See TracBrowser for help on using the repository browser.

Context Navigation

source: trunk/docs/proposed/lossmodel.lyx

Download in other formats: