From 40ae2f1ffd4787d46f92358daa15d7e172997e82 Mon Sep 17 00:00:00 2001 From: Jimmy Lin <jimmylin@uwaterloo.ca> Date: Wed, 11 Nov 2020 12:24:29 -0500 Subject: [PATCH] Update README.md --- README.md | 9 ++++++++- 1 file changed, 8 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index 0f3a40f..e9f0d63 100644 --- a/README.md +++ b/README.md @@ -6,7 +6,7 @@ These predicted questions (or queries) are then appended to the original documen ## docTTTTTquery (with T5) -Models and data from [`https://github.com/castorini/docTTTTTquery/`](https://github.com/castorini/docTTTTTquery/): +Models and data from [`https://github.com/castorini/docTTTTTquery/`](https://github.com/castorini/docTTTTTquery/) for passage ranking: File | Size | MD5 | Download :----|-----:|:----|:----- @@ -19,3 +19,10 @@ File | Size | MD5 | Download `t5-base.zip` | 357 MB | `881d3ca87c307b3eac05fae855c79014` | [[Download](https://git.uwaterloo.ca/jimmylin/doc2query-data/raw/master/T5-passage/t5-base.zip)] `t5-large.zip` | 1.2 GB | `21c7e625210b0ae872679bc36ed92d44` | [[Download](https://git.uwaterloo.ca/jimmylin/doc2query-data/raw/master/T5-passage/t5-large.zip)] +Models and data from [`https://github.com/castorini/docTTTTTquery/`](https://github.com/castorini/docTTTTTquery/) for document ranking: + +File | Size | MD5 | Download +:----|-----:|:----|:----- +`msmarco-docs.tsv.gz` | 7.9 GB | `103b19e21ad324d8a5f1ab562425c0b4` | [[Download](https://git.uwaterloo.ca/jimmylin/doc2query-data/raw/master/T5-doc/msmarco-docs.tsv.gz)] +`predicted_queries_doc.tar.gz` | 2.2 GB | `4967214dfffbd33722837533c838143d` | [[Download](https://git.uwaterloo.ca/jimmylin/doc2query-data/raw/master/T5-doc/predicted_queries_doc.tar.gz)] +`segment_doc_ids.txt` | 170 MB | `82c00bebab0d98c1dc07d78fac3d8b8d` | [[Download](https://git.uwaterloo.ca/jimmylin/doc2query-data/raw/master/T5-doc/segment_doc_ids.txt)] -- GitLab