diff --git a/README.md b/README.md index 0f3a40f489e1a81ddf4b53c0b6576f4ec10acf80..e9f0d636d73e7d22934c1086eb357cca0c8411d6 100644 --- a/README.md +++ b/README.md @@ -6,7 +6,7 @@ These predicted questions (or queries) are then appended to the original documen ## docTTTTTquery (with T5) -Models and data from [`https://github.com/castorini/docTTTTTquery/`](https://github.com/castorini/docTTTTTquery/): +Models and data from [`https://github.com/castorini/docTTTTTquery/`](https://github.com/castorini/docTTTTTquery/) for passage ranking: File | Size | MD5 | Download :----|-----:|:----|:----- @@ -19,3 +19,10 @@ File | Size | MD5 | Download `t5-base.zip` | 357 MB | `881d3ca87c307b3eac05fae855c79014` | [[Download](https://git.uwaterloo.ca/jimmylin/doc2query-data/raw/master/T5-passage/t5-base.zip)] `t5-large.zip` | 1.2 GB | `21c7e625210b0ae872679bc36ed92d44` | [[Download](https://git.uwaterloo.ca/jimmylin/doc2query-data/raw/master/T5-passage/t5-large.zip)] +Models and data from [`https://github.com/castorini/docTTTTTquery/`](https://github.com/castorini/docTTTTTquery/) for document ranking: + +File | Size | MD5 | Download +:----|-----:|:----|:----- +`msmarco-docs.tsv.gz` | 7.9 GB | `103b19e21ad324d8a5f1ab562425c0b4` | [[Download](https://git.uwaterloo.ca/jimmylin/doc2query-data/raw/master/T5-doc/msmarco-docs.tsv.gz)] +`predicted_queries_doc.tar.gz` | 2.2 GB | `4967214dfffbd33722837533c838143d` | [[Download](https://git.uwaterloo.ca/jimmylin/doc2query-data/raw/master/T5-doc/predicted_queries_doc.tar.gz)] +`segment_doc_ids.txt` | 170 MB | `82c00bebab0d98c1dc07d78fac3d8b8d` | [[Download](https://git.uwaterloo.ca/jimmylin/doc2query-data/raw/master/T5-doc/segment_doc_ids.txt)]