queryner

Query segmentation dataset

To create the dataset from offsets, use the following commands:

conda create -yn queryner python=3.8

conda activate queryner

pip install -r requirements.txt

./prepare_dataset.sh

The resulting train / dev / test split files will be in the queryner_data directory.

To create the create the files with individual annotatations from each of the three annotators on the test set, run

./assemble_individual_annotators.sh

The resulting annotation files will be in the individual_annotations directory.

These scripts will download the original raw queries from the Amazon ESCI dataset and apply the QueryNER offsets in order to generate data in the BIO CONLL-style format.

The dataset and models are also accessible on 🤗 HuggingFace:

Citation

If you use the dataset or models, please cite our paper.

@misc{palenmichel2024queryner,
      title={QueryNER: Segmentation of E-commerce Queries}, 
      author={Chester Palen-Michel and Lizzie Liang and Zhe Wu and Constantine Lignos},
      year={2024},
      eprint={2405.09507},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
individual_annotator_offsets		individual_annotator_offsets
offset-splits		offset-splits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
assemble_dataset.py		assemble_dataset.py
assemble_individual_annotators.sh		assemble_individual_annotators.sh
prepare_dataset.sh		prepare_dataset.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

queryner

Query segmentation dataset

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

bltlab/query-ner

Folders and files

Latest commit

History

Repository files navigation

queryner

Query segmentation dataset

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages