Metadata-Version: 2.1
Name: pyserini
Version: 0.8.1.0
Summary: Python interface to the Anserini IR toolkit built on Lucene
Home-page: https://github.com/castorini/pyserini
Author: Jimmy Lin
Author-email: jimmylin@uwaterloo.ca
License: UNKNOWN
Description: Pyserini provides a simple Python interface to the [Anserini](http://anserini.io/) IR toolkit via [pyjnius](https://github.com/kivy/pyjnius).
        
        ## Installation
        
        Install via PyPI
        
        ```
        pip install pyserini
        ```
        
        ## Usage
        
        Here's a sample pre-built index on TREC Disks 4 &amp; 5 to play with (used in the [TREC 2004 Robust Track](https://github.com/castorini/anserini/blob/master/docs/regressions-robust04.md)):
        
        ```bash
        wget https://git.uwaterloo.ca/jimmylin/anserini-indexes/raw/master/index-robust04-20191213.tar.gz
        tar xvfz index-robust04-20191213.tar.gz
        ```
        
        Use the `SimpleSearcher` for searching:
        
        ```python
        from pyserini.search import pysearch
        
        searcher = pysearch.SimpleSearcher('index-robust04-20191213/')
        hits = searcher.search('hubble space telescope')
        
        # Print the first 10 hits:
        for i in range(0, 10):
            print(f'{i+1} {hits[i].docid} {hits[i].score}')
        
        # Grab the actual text:
        hits[0].content
        ```
        
        For additional information, please refer to the [Pyserini repository](https://github.com/castorini/pyserini/).
        
Platform: UNKNOWN
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: Apache Software License
Classifier: Operating System :: OS Independent
Requires-Python: >=3.6
Description-Content-Type: text/markdown
