diff options
author | Karsten Loesing <karsten.loesing@gmx.net> | 2009-09-03 14:44:01 +0200 |
---|---|---|
committer | Mike Perry <mikeperry-git@fscked.org> | 2009-09-16 15:52:05 -0700 |
commit | b508e4748f42436ad1e9b05970cc4d3c5c1debfc (patch) | |
tree | 74e88dcf7587d23a15492672a0b781ce78246edc | |
parent | 535423a3bb87fcdafc4f25f8a7e3898b127ff77c (diff) | |
download | tor-b508e4748f42436ad1e9b05970cc4d3c5c1debfc.tar.gz tor-b508e4748f42436ad1e9b05970cc4d3c5c1debfc.zip |
Remove trailing spaces. As if bytes were free...
Also correct some typos.
-rw-r--r-- | doc/spec/proposals/151-path-selection-improvements.txt | 52 | ||||
-rw-r--r-- | src/or/circuitbuild.c | 9 |
2 files changed, 31 insertions, 30 deletions
diff --git a/doc/spec/proposals/151-path-selection-improvements.txt b/doc/spec/proposals/151-path-selection-improvements.txt index df19e0f71f..7821a5dddb 100644 --- a/doc/spec/proposals/151-path-selection-improvements.txt +++ b/doc/spec/proposals/151-path-selection-improvements.txt @@ -8,7 +8,7 @@ Overview The performance of paths selected can be improved by adjusting the CircuitBuildTimeout and avoiding failing guard nodes. This proposal - describes a method of tracking buildtime statistics at the client, and + describes a method of tracking buildtime statistics at the client, and using those statistics to adjust the CircuitBuildTimeout. Motivation @@ -30,15 +30,15 @@ Implementation too large, because it will make it difficult for clients to adapt to moving between different links. - From our observations, this value appears to be on the order of 1000, + From our observations, this value appears to be on the order of 1000, but is configurable in a #define NCIRCUITS_TO_OBSERVE. - + Long Term Storage - The long-term storage representation is implemented by storing a - histogram with BUILDTIME_BIN_WIDTH millisecond buckets (default 50) when + The long-term storage representation is implemented by storing a + histogram with BUILDTIME_BIN_WIDTH millisecond buckets (default 50) when writing out the statistics to disk. The format this takes in the - state file is 'CircuitBuildTime <bin-ms> <count>', with the total + state file is 'CircuitBuildTime <bin-ms> <count>', with the total specified as 'TotalBuildTimes <total>' Example: @@ -57,7 +57,7 @@ Implementation Learning the CircuitBuildTimeout Based on studies of build times, we found that the distribution of - circuit buildtimes appears to be a Pareto distribution. + circuit buildtimes appears to be a Pareto distribution. We will calculate the parameters for a Pareto distribution fitting the data using the estimators at @@ -68,7 +68,7 @@ Implementation BUILDTIME_PERCENT_CUTOFF (80%) of the mass of the distribution is below the timeout value. - Thus, we expect that the Tor client will accept the fastest 80% of + Thus, we expect that the Tor client will accept the fastest 80% of the total number of paths on the network. Detecting Changing Network Conditions @@ -76,65 +76,65 @@ Implementation We attempt to detect both network connectivty loss and drastic changes in the timeout characteristics. Network connectivity loss is detected by recording a timestamp every time Tor either completes - a TLS connection or receives a cell. If this timestamp is more than + a TLS connection or receives a cell. If this timestamp is more than 90 seconds in the past, circuit timeouts are no longer counted. - If more than MAX_RECENT_TIMEOUT_RATE (80%) of the past + If more than MAX_RECENT_TIMEOUT_RATE (80%) of the past RECENT_CIRCUITS (20) time out, we assume the network connection has changed, and we discard all buildtimes history and compute a new timeout by estimating a new Pareto curve using the position on the Pareto Quartile function for the ratio of - timeouts. + timeouts. Testing After circuit build times, storage, and learning are implemented, the resulting histogram should be checked for consistency by - verifying it persists across successive Tor invocations where + verifying it persists across successive Tor invocations where no circuits are built. In addition, we can also use the existing - buildtime scripts to record build times, and verify that the histogram + buildtime scripts to record build times, and verify that the histogram the python produces matches that which is output to the state file in Tor, and verify that the Pareto parameters and cutoff points also match. - + Soft timeout vs Hard Timeout - - At some point, it may be desirable to change the cutoff from a + + At some point, it may be desirable to change the cutoff from a single hard cutoff that destroys the circuit to a soft cutoff and a hard cutoff, where the soft cutoff merely triggers the building - of a new circuit, and the hard cutoff triggers destruction of the + of a new circuit, and the hard cutoff triggers destruction of the circuit. - Good values for hard and soft cutoffs seem to be 80% and 60% + Good values for hard and soft cutoffs seem to be 80% and 60% respectively, but we should eventually justify this with observation. When to Begin Calculation - The number of circuits to observe (NCIRCUITS_TO_CUTOFF) before - changing the CircuitBuildTimeout will be tunable via a #define. From - our measurements, a good value for NCIRCUITS_TO_CUTOFF appears to be + The number of circuits to observe (NCIRCUITS_TO_CUTOFF) before + changing the CircuitBuildTimeout will be tunable via a #define. From + our measurements, a good value for NCIRCUITS_TO_CUTOFF appears to be on the order of 100. Dealing with Timeouts - Timeouts should be counted as the expectation of the region of + Timeouts should be counted as the expectation of the region of of the Pareto distribution beyond the cutoff. The proposal will be updated with this value soon. - Also, in the event of network failure, the observation mechanism + Also, in the event of network failure, the observation mechanism should stop collecting timeout data. Client Hints Some research still needs to be done to provide initial values for CircuitBuildTimeout based on values learned from modem - users, DSL users, Cable Modem users, and dedicated links. A + users, DSL users, Cable Modem users, and dedicated links. A radiobutton in Vidalia should eventually be provided that - sets CircuitBuildTimeout to one of these values and also + sets CircuitBuildTimeout to one of these values and also provide the option of purging all learned data, should any exist. These values can either be published in the directory, or shipped hardcoded for a particular Tor version. - + Issues Impact on anonymity diff --git a/src/or/circuitbuild.c b/src/or/circuitbuild.c index 0a55d8f86a..39e5a43f5e 100644 --- a/src/or/circuitbuild.c +++ b/src/or/circuitbuild.c @@ -308,7 +308,7 @@ circuit_build_times_parse_state(circuit_build_times_t *cbt, SPLIT_SKIP_SPACE|SPLIT_IGNORE_BLANK, 0); if (smartlist_len(args) < 2) { *msg = tor_strdup("Unable to parse circuit build times: " - "Too few arguments to CircuitBuildTIme"); + "Too few arguments to CircuitBuildTime"); break; } else { const char *ms_str = smartlist_get(args,0); @@ -375,7 +375,7 @@ circuit_build_times_update_alpha(circuit_build_times_t *cbt) } if (n!=cbt->total_build_times) { - log_err(LD_CIRC, "Discrepency in build times count: %d vs %d", n, + log_err(LD_CIRC, "Discrepancy in build times count: %d vs %d", n, cbt->total_build_times); } tor_assert(n==cbt->total_build_times); @@ -396,7 +396,8 @@ circuit_build_times_update_alpha(circuit_build_times_t *cbt) * * See http://en.wikipedia.org/wiki/Quantile_function, * http://en.wikipedia.org/wiki/Inverse_transform_sampling and - * http://en.wikipedia.org/wiki/Pareto_distribution#Generating_a_random_sample_from_Pareto_distribution + * http://en.wikipedia.org/wiki/Pareto_distribution#Generating_a_ + * random_sample_from_Pareto_distribution * That's right. I'll cite wikipedia all day long. */ double @@ -551,7 +552,7 @@ circuit_build_times_check_too_many_timeouts(circuit_build_times_t *cbt) } timeout_rate /= RECENT_CIRCUITS; - /* If more then 80% of our recent circuits are timing out, + /* If more than 80% of our recent circuits are timing out, * we need to re-estimate a new initial alpha and timeout */ if (timeout_rate < MAX_RECENT_TIMEOUT_RATE) { return 0; |