WARC File

We are using request, response and metadata WARC types. In the metadata section we convey additional information about the Memento response, such as the period when the digest of the page bitestream was the same (start, end) and number of hits during that period. As you can see below,the WARC-Concurrent-To: parameter is used to connect the response record with the corresponding metadata record.

WARC/1.0
WARC-Type: warcinfo
WARC-Date: 2012-05-04T17:50:32Z
WARC-Filename: MEM-20120504175032789-00000-~~.warc.gz
WARC-Record-ID: <urn:uuid:6c9458de-88ff-4042-8bac-3f7088259637>
Content-Type: application/warc-fields
Content-Length: 124

format:WARC File Format 1.0
conformsTo:http://bibnum.bnf.fr/WARC/WARC_ISO_28500_version1_latestdraft.pdf
operator:tr-achive


WARC/1.0
WARC-Type: request
WARC-Target-URI: http://dans.knaw.nl/nieuws/rss.xml
WARC-Date: 2012-04-29T00:00:30Z
WARC-Concurrent-To: <urn:uuid:7e2b792d-3048-4c72-9b3e-c32db0e30091>
WARC-Record-ID: <urn:uuid:a3408fc2-a0eb-4e2a-b2c8-5bda5d2f36dd>
Content-Type: application/http; msgtype=request
Content-Length: 336

GET /nieuws/rss.xml HTTP/1.1
User-Agent:Feedfetcher-Google; (+http://www.google.com/feedfetcher.html; 12 subscribers; feed-id=4533563086221712840)
Accept:*/*
Accept-Encoding:gzip,deflate
Connection:Keep-alive
Host:www.dans.knaw.nl
If-Modified-Since:Sat, 28 Apr 2012 22:16:44 GMT
If-None-Match:"6df79016d88fe9071b56e0cb16275936"


WARC/1.0
WARC-Type: metadata
WARC-Target-URI: http://dans.knaw.nl/nieuws/rss.xml
WARC-Date: 2012-04-29T00:00:30Z
WARC-Concurrent-To: <urn:uuid:7e2b792d-3048-4c72-9b3e-c32db0e30091>
WARC-Record-ID: <urn:uuid:ef2e657d-aff5-4ff5-93d0-81a8f6647bd2>
Content-Type: application/warc-fields
Content-Length: 107

start:2012-04-29T00:00:30Z
end:2012-04-29T02:42:06Z
digest:RUN3SNH35XDGQAXRWAA654YDHJT7AL7K
numberOfHits:8


WARC/1.0
WARC-Type: response
WARC-Target-URI: http://dans.knaw.nl/nieuws/rss.xml
WARC-Date: 2012-04-29T00:00:30Z
WARC-Payload-Digest: sha1:RUN3SNH35XDGQAXRWAA654YDHJT7AL7K
WARC-Record-ID: <urn:uuid:7e2b792d-3048-4c72-9b3e-c32db0e30091>
Content-Type: application/http; msgtype=response
Content-Length: 21878

HTTP/1.1 200 OK
Date: Sun, 29 Apr 2012 00:00:30 GMT
Server: Apache/2.2.3 (Red Hat)
Server: Apache/2.2.3 (Red Hat)
X-Powered-By: PHP/5.1.6
Set-Cookie: SESS4a0091108fb271e05f34da7cf77c975f=l7v3v251rf4ck4e7e6a87vmkj0; expires=Tue, 22 May 2012 03:33:50 GMT; path=/
Last-Modified: Sat, 28 Apr 2012 23:18:30 GMT
ETag: "dde9afaf0dc83c5c1e944b349f5f7c1a"
Expires: Sun, 19 Nov 1978 05:00:00 GMT
Cache-Control: must-revalidate
Connection: close
Transfer-Encoding: chunked
Content-Type: application/rss+xml; charset=utf-8

<?xml version="1.0" encoding="utf-8" ?><rss version="2.0" xml:base="http://www.dans.knaw.nl/nieuws/rss.xml" xmlns:dc="http://purl
.org/dc/elements/1.1/">
  <channel>
...