Getting Started Guide

This package is a work in progress tool for expanding search queries by adding domain relevant keywords from a sample text. The primary function "automate_keywords()" also provides meta-data about keywords using word counts, parts-of-speech, and named entity tagging to assist with keyword selection.

The package currently only supports English language query expansion. Additional languages will be supported in later iterations.

Install rKeywords in R from GitHub

devtools::install_github('seankellyhp/rKeywords')

Install and configure Spacyr using the following guide.

See https://spacyr.quanteda.io/ for miniconda and spacyr installation

library("spacyr")
spacy_install() 
spacy_download_langmodel("en_core_web_lg") # Optional
spacy_initialize()
spacy_initialize(save_profile = TRUE) # Optional

Automatically discover new keywords using query expansion

require(rKeywords, quietly = TRUE )
require(dplyr, quietly = TRUE)
require(quanteda, quietly = TRUE)
require(stringr, quietly = TRUE)

Input starting search string

rawString <- "Immigrant* OR migrant* OR asylum seeker* OR visa*"
seedWords <- convert_bool(rawString)

Load sample corpus

Should be a sample of domain relevant text data in quanteda corpus format.

rawCorp <- readRDS("data/uk_eng_corp_sample.rds")

Path to pre-trained GloVe word embedding model

modelPath <- 'models/glove.6B.300d.txt'

Expand keywords

keywordsNew <- automate_keywords(seedWords = seedWords, 
corpus = rawCorp, 
modelPath = modelPath, 
nCandidates = 200)

queryNew <- create_query(keywordsNew, n = 40, type = "regex")
print(queryNew)

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
R		R
data		data
examples		examples
man		man
models		models
.DS_Store		.DS_Store
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
DESCRIPTION		DESCRIPTION
NAMESPACE		NAMESPACE
README.html		README.html
README.md		README.md
rKeywords.Rproj		rKeywords.Rproj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Getting Started Guide

Install rKeywords in R from GitHub

Install and configure Spacyr using the following guide.

See https://spacyr.quanteda.io/ for miniconda and spacyr installation

Automatically discover new keywords using query expansion

Input starting search string

Load sample corpus

Path to pre-trained GloVe word embedding model

Expand keywords

About

Releases

Packages

Languages

seankellyhp/rKeywords

Folders and files

Latest commit

History

Repository files navigation

Getting Started Guide

Install rKeywords in R from GitHub

Install and configure Spacyr using the following guide.

See https://spacyr.quanteda.io/ for miniconda and spacyr installation

Automatically discover new keywords using query expansion

Input starting search string

Load sample corpus

Path to pre-trained GloVe word embedding model

Expand keywords

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages