|
|
 |
 |
 |
 |
 |
top
| File
formats for the passive-start measurements |
 |
 |
| detailed description |
 |
Quick links to passive-start
measurements' file types:
P-1 | P-2 | P-3
| P-4 | P-5
P-6 | P-7 | P-8
| P-9 | P-10
P-11 | P-12 |
P-13 | P-14 |
P-15
P-16 | P-17 |
|
During the passive-start measurements,
we have successfully tracked the top 750 files from Pirate
Bay (for more details on this read the measurements
page). Files were tracked from separate nodes in PlanetLab.
For each PlanetLab node, we created a number of timestamped
archives, each with the following structure:
- Directory base_data/
Info:
Used to store various files regarding general monitoring
process
- trackinginfo files [File
type P-1]
Info:
Used to store information about the peers contacting
the passive-start infrastructure.
Format:
TimeStamp
<tab> <IP> <tab>
BatchNo <tab> FileName
<tab> IP
<tab> Port
<tab> Unique
or
TimeStamp
<tab> <E-IP> <tab>
BatchNo <tab> RootIP
where:
- TimeStamp is
the time stamp of the observation;
- <IP> is
a constant string (IP), defining that the line
reports information about an IP;
- BatchNo is an
integer identifiying the set of measurements to
which the IP belongs; unique per machine;
- FileName is
the monitored (torrent) file to which this information
refers to;
- IP is the anonymized
IP address of the observed peer;
- Port is the
port used by the contacting peer;
- Unique reports
whether the peer was previously tracked (possible
values are Y and N);
- <E-IP>
is a constant string (E-IP), defining that the
line reports information about a peer tracking;
- RootIP shows
the IP to which the multi-source traceroutes were
started (the tracking target), and should match
the IP addresses identified in the previous <IP>
lines.
- trackingstats files [File
type P-2]
Info:
Used to store information about the tracking process.
Format:
TimeStamp
<sp> <BATCH> <sp>
BatchNo <sp> NBatchIPs
<sp> NUniqueBatchIPs
or
TimeStamp
<sp> <EDGES> <sp>
BatchNo <sp> NoDestinations
<sp> NPackets
where:
- TimeStamp is
the time stamp of the recording;
- <BATCH>
is a constant string (BATCH), defining that the
line reports information about a batch of IP addresses
to be tracked;
- BatchNo is an
integer identifiying the set of measurements to
which the recording belongs; unique per machine;
- NBatchIPs is
the number of IP addresses in this batch;
- NUniqueBatchIPs
is the number of IP addresses to be monitored
(torrent) in this batch;
- <EDGES>
is a constant string (EDGES), defining that the
line reports information about the tracking process
related to a certain batch of IP addresses;
- NoDestinations
shows the number of IP addresses to which the
multi-source traceroutes were started, and should
match the number identified in the previous <BATCH>
line;
- NoPackets shows
the number of packets used to track the batch
of IP addresses specified in the current line.
- destinations.dat files
[File type P-3]
Info:
Used to store the tracked IP addresses.
Format:
IP
One IP address per line.
- edges_cache.dat files
[File type P-4]
Info:
Used to store visited paths.
Format:
ExtIP1<sp>
ExtIP2
Two IP addresses per line, except that some IP addresses
may be concatenated with extra '+' signs, which signal
unresponsive hops in the traceroute process (see Scriptroute's
Reverse Path Tree script for a more detailed description
of the '+' sign's significance).
- sources.dat files [File
type P-5]
Info:
Used to store the IP addresses of the PlanetLab multi-traceroute
sources. Only a limited number of these sources were
used in a single multi-source traceroute (we did not
trace from all sources to all peers, but from a fixe
number of randomly selected sources to all peers).
Format:
IP
One IP address per line.
- Directory torrent_data/
Info:
Used to store various files regarding per-file monitoring
process.
- buffer.err, buffer.res
files [File types P-6,
P-7]
Info:
Used to store information about the IP addresses buffer
[debugging information].
- Torrent<ID>-LP.err,Torrent<ID>-LP.res
files [File types P-8,
P-9]
Info:
Used to store information about tracking a file (<ID>
is the tracked file's ID, given as an 8-digit number,
e.g., 0000000).
Format:
The err file should be empty;
otherwise, it contains a detailed error report.
The res file has the following
content:
# Comment line
or
TimeStamp
<sp> IP <sp>
Port <sp> FileName
<sp> <[connection]>
where:
- TimeStamp is
the time stamp of the recording;
- IP is the IP
addresses of the contacting peer;
- Port is the
TCP port on which the contacting peer listens;
- FileName is
the name of the torrent for which this connection
was issued;
- <[connection]>
is a constant string ([connection]),
defining that the line reports information about
an incoming connection.
- tracker.err,tracker.res
files [File types P-10,
P-11]
Info:
Used to store debugging information. Should be empty.
- Directory batch<ID>data/
Info:
Used to store per-batch information regarding the tracking
process. <ID> represents
the batch identification number, unique per PlanetLab
node.
- ipaddresses file [File
type P-12]
Info:
Used to store information about the IP addresses to
be tracked in this batch.
Format:
IP
One IP per line (typically <100).
- measure_edges.err,measure_edges.out
files [File types P-13,
P-14]
Info:
Used to store information about multi-source tracerouting
process. Useful only for the authors of Scriptroute.
- new_edges_cache.dat file
[File type P-15]
Info:
Used to store information about the edges newly discovered
in the multi-source tracerouting process. Same structure
as file type P-4.
- Sub-directory paths/
Info:
Used to store detailed information about the multi-source
traceroutes.
- path_<SrcIP>_<DstIP>
files [File types P-16]
Info:
Used to store detailed information about the SrcIP
path to DstIP hops.
Format:
HopNo <sp>
IP <sp> Time
<ms>
or
<packetcount:>
<sp> NoPackets
or
Error
where:
- HopNo is
the number of hop observed in the path;
- IP is the
IP addresses of the identified path in the
hop, or unresponsive,
if the hop was unresponsive;
- Time is
the latency of the packets towards the identified
hop;
- NoPackets
is the number of packets used to identify
all the path's hops.
- sources-<DstIP>.dat
files [File types P-17]
Info:
Used to store information about the source IP
addresses in the multi-source traceroute towards
a destination IP address (<DstIP>).
Format:
IP
One IP address per line.
|
|
 |
|
 |
|