Ticket #1795: ucw_text_transcript.log

File ucw_text_transcript.log, 14.5 KB (added by jean, at 2012-07-25T03:16:40Z)

Transcript of the incident report

Line 
1local#6637 16:21:45.808: SharemapUpdater(q5x5y): starting (MODE_CHECK)
2local#6651 16:21:45.903: got result from [czhyfn], 1 shares
3local#6654 16:21:45.906:  found valid version 1-nrye from czhyfn-sh3: 3-10/33/31
4local#6659 16:21:45.910: got result from [vl4mhy], 1 shares
5local#6661 16:21:45.916: got result from [otatih], 1 shares
6local#6663 16:21:45.920: got result from [dpertc], 0 shares
7local#6669 16:21:45.923: got result from [pwy4gw], 0 shares
8local#6675 16:21:45.927: got result from [lfajxc], 0 shares
9local#6681 16:21:45.932: got result from [lomvn5], 1 shares
10local#6683 16:21:45.935: got result from [t66p3x], 1 shares
11local#6685 16:21:45.941: got result from [7ytyt2], 1 shares
12local#6687 16:21:45.946: got result from [k3zpuu], 1 shares
13local#6689 16:21:45.951: got result from [z5rven], 1 shares
14local#6691 16:21:45.956: got result from [lyxj6a], 1 shares
15local#6693 16:21:45.962: got result from [bt2zsj], 1 shares
16local#6696 16:21:45.964:  found valid version 1-nrye from vl4mhy-sh7: 3-10/33/31
17local#6702 16:21:45.965:  found valid version 1-nrye from otatih-sh8: 3-10/33/31
18local#6708 16:21:45.966:  found valid version 1-nrye from lomvn5-sh5: 3-10/33/31
19local#6714 16:21:45.967:  found valid version 1-nrye from t66p3x-sh6: 3-10/33/31
20local#6720 16:21:45.968:  found valid version 1-nrye from 7ytyt2-sh1: 3-10/33/31
21local#6726 16:21:45.969:  found valid version 1-nrye from k3zpuu-sh9: 3-10/33/31
22local#6732 16:21:45.970:  found valid version 1-nrye from z5rven-sh0: 3-10/33/31
23local#6738 16:21:45.971:  found valid version 1-nrye from lyxj6a-sh2: 3-10/33/31
24local#6744 16:21:45.972:  found valid version 1-nrye from bt2zsj-sh4: 3-10/33/31
25local#6746 16:21:45.973: all queries are retired, no extra servers: done
26local#6747 16:21:45.973: servermap: 10*seq1-nrye
27local#6750 16:21:45.974: SharemapUpdater(q5x5y): starting (MODE_WRITE)
28local#6764 16:21:46.065: got result from [pwy4gw], 0 shares
29local#6774 16:21:46.073: got result from [czhyfn], 1 shares
30local#6775 16:21:46.074: got valid privkey from shnum 3 on serverid czhyfn
31local#6777 16:21:46.079: got result from [vl4mhy], 1 shares
32local#6779 16:21:46.083: got result from [dpertc], 0 shares
33local#6789 16:21:46.087: got result from [lfajxc], 0 shares
34local#6790 16:21:46.087: _check_for_done, mode is 'MODE_WRITE', 10 queries outstanding, 0 extra servers available, 0 'must query' servers left, need_privkey=False
35local#6791 16:21:46.088: no recoverable versions: need more
36local#6792 16:21:46.088:  there are 10 queries outstanding
37local#6793 16:21:46.088: sending 0 more queries:
38local#6794 16:21:46.088: _got_results done
39local#6795 16:21:46.088: _check_for_done, mode is 'MODE_WRITE', 10 queries outstanding, 0 extra servers available, 0 'must query' servers left, need_privkey=False
40local#6796 16:21:46.088: no recoverable versions: need more
41local#6797 16:21:46.088:  there are 10 queries outstanding
42local#6798 16:21:46.088: sending 0 more queries:
43local#6799 16:21:46.093: got result from [lomvn5], 1 shares
44local#6800 16:21:46.094: _got_results done
45local#6801 16:21:46.096: got result from [t66p3x], 1 shares
46local#6802 16:21:46.097: _got_results done
47local#6803 16:21:46.102: got result from [7ytyt2], 1 shares
48local#6804 16:21:46.103: _got_results done
49local#6805 16:21:46.107: got result from [k3zpuu], 1 shares
50local#6806 16:21:46.109: _got_results done
51local#6807 16:21:46.112: got result from [z5rven], 1 shares
52local#6808 16:21:46.114: _got_results done
53local#6809 16:21:46.118: got result from [lyxj6a], 1 shares
54local#6810 16:21:46.119: _got_results done
55local#6811 16:21:46.119: _got_results: got shnum #3 from serverid czhyfn
56local#6812 16:21:46.120:  found valid version 1-nrye from czhyfn-sh3: 3-10/33/31
57local#6813 16:21:46.120: _check_for_done, mode is 'MODE_WRITE', 9 queries outstanding, 0 extra servers available, 0 'must query' servers left, need_privkey=False
58local#6814 16:21:46.120: no recoverable versions: need more
59local#6815 16:21:46.120:  there are 9 queries outstanding
60local#6816 16:21:46.120: sending 0 more queries:
61local#6817 16:21:46.121: _check_for_done, mode is 'MODE_WRITE', 9 queries outstanding, 0 extra servers available, 0 'must query' servers left, need_privkey=False
62local#6818 16:21:46.121: no recoverable versions: need more
63local#6819 16:21:46.121:  there are 9 queries outstanding
64local#6820 16:21:46.121: sending 0 more queries:
65local#6821 16:21:46.121: _got_results: got shnum #7 from serverid vl4mhy
66local#6822 16:21:46.121:  found valid version 1-nrye from vl4mhy-sh7: 3-10/33/31
67local#6823 16:21:46.122: _check_for_done, mode is 'MODE_WRITE', 8 queries outstanding, 0 extra servers available, 0 'must query' servers left, need_privkey=False
68local#6824 16:21:46.122: no recoverable versions: need more
69local#6825 16:21:46.122:  there are 8 queries outstanding
70local#6826 16:21:46.122: sending 0 more queries:
71local#6827 16:21:46.122: _check_for_done, mode is 'MODE_WRITE', 8 queries outstanding, 0 extra servers available, 0 'must query' servers left, need_privkey=False
72local#6828 16:21:46.122: no recoverable versions: need more
73local#6829 16:21:46.122:  there are 8 queries outstanding
74local#6830 16:21:46.122: sending 0 more queries:
75local#6831 16:21:46.122: _got_results: got shnum #5 from serverid lomvn5
76local#6832 16:21:46.123:  found valid version 1-nrye from lomvn5-sh5: 3-10/33/31
77local#6833 16:21:46.123: _check_for_done, mode is 'MODE_WRITE', 7 queries outstanding, 0 extra servers available, 0 'must query' servers left, need_privkey=False
78local#6834 16:21:46.124: found our boundary, 1111?111?1000
79local#6835 16:21:46.124:  there are 7 queries outstanding
80local#6836 16:21:46.124: sending 0 more queries:
81local#6837 16:21:46.124: _check_for_done, mode is 'MODE_WRITE', 7 queries outstanding, 0 extra servers available, 0 'must query' servers left, need_privkey=False
82local#6838 16:21:46.124: found our boundary, 1111?111?1000
83local#6839 16:21:46.124:  there are 7 queries outstanding
84local#6840 16:21:46.124: sending 0 more queries:
85local#6841 16:21:46.125: _got_results: got shnum #6 from serverid t66p3x
86local#6842 16:21:46.125:  found valid version 1-nrye from t66p3x-sh6: 3-10/33/31
87local#6843 16:21:46.125: _check_for_done, mode is 'MODE_WRITE', 6 queries outstanding, 0 extra servers available, 0 'must query' servers left, need_privkey=False
88local#6844 16:21:46.126: found our boundary, 1111?111?1000
89local#6845 16:21:46.126:  there are 6 queries outstanding
90local#6846 16:21:46.126: sending 0 more queries:
91local#6847 16:21:46.126: _check_for_done, mode is 'MODE_WRITE', 6 queries outstanding, 0 extra servers available, 0 'must query' servers left, need_privkey=False
92local#6848 16:21:46.127: found our boundary, 1111?111?1000
93local#6849 16:21:46.127:  there are 6 queries outstanding
94local#6850 16:21:46.127: sending 0 more queries:
95local#6851 16:21:46.127: _got_results: got shnum #1 from serverid 7ytyt2
96local#6852 16:21:46.127:  found valid version 1-nrye from 7ytyt2-sh1: 3-10/33/31
97local#6853 16:21:46.127: _check_for_done, mode is 'MODE_WRITE', 5 queries outstanding, 0 extra servers available, 0 'must query' servers left, need_privkey=False
98local#6854 16:21:46.128: found our boundary, 1111?111?1000
99local#6855 16:21:46.128:  there are 5 queries outstanding
100local#6856 16:21:46.128: sending 0 more queries:
101local#6857 16:21:46.128: _check_for_done, mode is 'MODE_WRITE', 5 queries outstanding, 0 extra servers available, 0 'must query' servers left, need_privkey=False
102local#6858 16:21:46.129: found our boundary, 1111?111?1000
103local#6859 16:21:46.129:  there are 5 queries outstanding
104local#6860 16:21:46.129: sending 0 more queries:
105local#6861 16:21:46.129: _got_results: got shnum #9 from serverid k3zpuu
106local#6862 16:21:46.129:  found valid version 1-nrye from k3zpuu-sh9: 3-10/33/31
107local#6863 16:21:46.130: _check_for_done, mode is 'MODE_WRITE', 4 queries outstanding, 0 extra servers available, 0 'must query' servers left, need_privkey=False
108local#6864 16:21:46.130: found our boundary, 1111?111?1000
109local#6865 16:21:46.130:  there are 4 queries outstanding
110local#6866 16:21:46.130: sending 0 more queries:
111local#6867 16:21:46.131: _check_for_done, mode is 'MODE_WRITE', 4 queries outstanding, 0 extra servers available, 0 'must query' servers left, need_privkey=False
112local#6868 16:21:46.131: found our boundary, 1111?111?1000
113local#6869 16:21:46.131:  there are 4 queries outstanding
114local#6870 16:21:46.131: sending 0 more queries:
115local#6871 16:21:46.131: _got_results: got shnum #0 from serverid z5rven
116local#6872 16:21:46.132:  found valid version 1-nrye from z5rven-sh0: 3-10/33/31
117local#6873 16:21:46.132: _check_for_done, mode is 'MODE_WRITE', 3 queries outstanding, 0 extra servers available, 0 'must query' servers left, need_privkey=False
118local#6874 16:21:46.133: found our boundary, 1111?111?1000
119local#6875 16:21:46.133:  there are 3 queries outstanding
120local#6876 16:21:46.133: sending 0 more queries:
121local#6877 16:21:46.133: _check_for_done, mode is 'MODE_WRITE', 3 queries outstanding, 0 extra servers available, 0 'must query' servers left, need_privkey=False
122local#6878 16:21:46.134: found our boundary, 1111?111?1000
123local#6879 16:21:46.134:  there are 3 queries outstanding
124local#6880 16:21:46.134: sending 0 more queries:
125local#6881 16:21:46.134: _got_results: got shnum #2 from serverid lyxj6a
126local#6882 16:21:46.134:  found valid version 1-nrye from lyxj6a-sh2: 3-10/33/31
127local#6883 16:21:46.134: _check_for_done, mode is 'MODE_WRITE', 2 queries outstanding, 0 extra servers available, 0 'must query' servers left, need_privkey=False
128local#6884 16:21:46.135: found our boundary, 1111?111?1000
129local#6885 16:21:46.135:  there are 2 queries outstanding
130local#6886 16:21:46.135: sending 0 more queries:
131local#6887 16:21:46.135: _check_for_done, mode is 'MODE_WRITE', 2 queries outstanding, 0 extra servers available, 0 'must query' servers left, need_privkey=False
132local#6888 16:21:46.136: found our boundary, 1111?111?1000
133local#6889 16:21:46.136:  there are 2 queries outstanding
134local#6890 16:21:46.136: sending 0 more queries:
135local#6891 16:21:46.141: got result from [otatih], 1 shares
136local#6892 16:21:46.142: _got_results done
137local#6893 16:21:46.147: got result from [bt2zsj], 1 shares
138local#6894 16:21:46.148: _got_results done
139local#6895 16:21:46.149: _got_results: got shnum #8 from serverid otatih
140local#6896 16:21:46.149:  found valid version 1-nrye from otatih-sh8: 3-10/33/31
141local#6897 16:21:46.149: _check_for_done, mode is 'MODE_WRITE', 1 queries outstanding, 0 extra servers available, 0 'must query' servers left, need_privkey=False
142local#6898 16:21:46.150: found our boundary, 1111111111000
143local#6899 16:21:46.150: have all our answers
144local#6900 16:21:46.151: servermap: 9*seq1-nrye
145local#6901 16:21:46.151: _check_for_done, mode is 'MODE_WRITE', 1 queries outstanding, 0 extra servers available, 0 'must query' servers left, need_privkey=False
146local#6902 16:21:46.151: but we're not running
147local#6903 16:21:46.151: _got_results: got shnum #4 from serverid bt2zsj
148local#6904 16:21:46.151: but we're not running anymore.
149local#6905 16:21:46.151: _check_for_done, mode is 'MODE_WRITE', 0 queries outstanding, 0 extra servers available, 0 'must query' servers left, need_privkey=False
150local#6906 16:21:46.151: but we're not running
151local#6907 16:21:46.151: _check_for_done, mode is 'MODE_WRITE', 0 queries outstanding, 0 extra servers available, 0 'must query' servers left, need_privkey=False
152local#6908 16:21:46.152: but we're not running
153local#6909 16:21:46.152: Publish(q5x5y): starting
154local#6910 16:21:46.152: starting publish, datalen is 1199
155local#6911 16:21:46.153: new seqnum will be 2
156local#6912 16:21:46.153: building encoding parameters for file
157local#6913 16:21:46.153: got segsize 1200
158local#6914 16:21:46.153: got 1 segments
159local#6915 16:21:46.153: got tail segment size 1199
160local#6916 16:21:46.153: got start segment 0
161local#6917 16:21:46.153: got end segment 0
162local#6918 16:21:46.154: current goal: before update: , sh0 to [z5rven], sh1 to [7ytyt2], sh2 to [lyxj6a], sh3 to [czhyfn], sh5 to [lomvn5], sh6 to [t66p3x], sh7 to [vl4mhy], sh8 to [otatih], sh9 to [k3zpuu]
163local#6919 16:21:46.154: we are planning to push new seqnum=#2
164local#6920 16:21:46.154: current goal: after update: , sh0 to [z5rven], sh1 to [7ytyt2], sh2 to [lyxj6a], sh3 to [czhyfn], sh4 to [bt2zsj], sh5 to [lomvn5], sh6 to [t66p3x], sh7 to [vl4mhy], sh8 to [otatih], sh9 to [k3zpuu]
165local#6921 16:21:46.154: we are planning to push new seqnum=#2
166local#6922 16:21:46.159: Starting push
167local#6923 16:21:46.160: Pushing segment 1 of 1
168local#6924 16:21:46.351: _got_write_answer from czhyfn, share 3
169local#6925 16:21:46.351: found the following surprise shares: set([])
170local#6926 16:21:46.351: wrote successfully: adding new share to servermap
171local#6927 16:21:46.361: _got_write_answer from vl4mhy, share 7
172local#6928 16:21:46.361: found the following surprise shares: set([])
173local#6929 16:21:46.361: wrote successfully: adding new share to servermap
174local#6930 16:21:46.372: _got_write_answer from lomvn5, share 5
175local#6931 16:21:46.373: found the following surprise shares: set([])
176local#6932 16:21:46.373: wrote successfully: adding new share to servermap
177local#6933 16:21:46.375: _got_write_answer from t66p3x, share 6
178local#6934 16:21:46.375: found the following surprise shares: set([])
179local#6935 16:21:46.375: wrote successfully: adding new share to servermap
180local#6936 16:21:46.383: _got_write_answer from 7ytyt2, share 1
181local#6937 16:21:46.383: found the following surprise shares: set([])
182local#6938 16:21:46.383: wrote successfully: adding new share to servermap
183local#6939 16:21:46.390: _got_write_answer from k3zpuu, share 9
184local#6940 16:21:46.391: found the following surprise shares: set([])
185local#6941 16:21:46.391: wrote successfully: adding new share to servermap
186local#6942 16:21:46.397: _got_write_answer from z5rven, share 0
187local#6943 16:21:46.397: found the following surprise shares: set([])
188local#6944 16:21:46.397: wrote successfully: adding new share to servermap
189local#6945 16:21:46.403: _got_write_answer from lyxj6a, share 2
190local#6946 16:21:46.403: found the following surprise shares: set([])
191local#6947 16:21:46.403: wrote successfully: adding new share to servermap
192local#6948 16:21:46.411: _got_write_answer from otatih, share 8
193local#6949 16:21:46.411: found the following surprise shares: set([])
194local#6950 16:21:46.411: wrote successfully: adding new share to servermap
195local#6951 16:21:46.418: _got_write_answer from bt2zsj, share 4
196local#6952 16:21:46.418: found the following surprise shares: set([])
197local#6953 16:21:46.419: our testv failed, so the write did not happen [INCIDENT-TRIGGER]
198local#6954 16:21:47.586: Publish failed with UncoordinatedWriteError
199local#6955 16:21:47.588: [Failure instance: Traceback (failure with no frames): <class 'allmydata.mutable.common.UncoordinatedWriteError'>:
200]