-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathDESCRIPTION
More file actions
42 lines (42 loc) · 1.35 KB
/
DESCRIPTION
File metadata and controls
42 lines (42 loc) · 1.35 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
Package: textpress
Type: Package
Title: A Lightweight and Versatile NLP Toolkit
Version: 1.1.0
Authors@R: c(person("Jason", "Timm", role = c("aut", "cre"), email = "JaTimm@salud.unm.edu"))
Maintainer: Jason Timm <JaTimm@salud.unm.edu>
Description: A rugged, practical toolkit for web scraping, stepwise NLP
pipelines, and lightweight LLM context engineering. The API follows a
consistent, predictable flow organized around four core actions:
fetching, reading, processing, and searching. Functions cover the full
pipeline from raw web data acquisition to structural text processing and
BM25 indexing. Multiple retrieval strategies are supported, including
regex, dictionary matching, and ranked keyword search. Designed to be
transparent and unobtrusive: pipe-friendly, no heavy dependencies, and
all outputs are plain data frames or data.tables rather than complex
object classes.
License: MIT + file LICENSE
Encoding: UTF-8
LazyData: false
Depends:
R (>= 3.5)
Imports:
data.table,
httr,
Matrix,
rvest,
stringi,
stringr,
xml2,
pbapply,
jsonlite,
lubridate
Suggests:
SnowballC (>= 0.7.0),
DT,
dplyr,
ellmer,
knitr,
rmarkdown
RoxygenNote: 7.3.3
URL: https://github.com/jaytimm/textpress, https://jaytimm.github.io/textpress/
BugReports: https://github.com/jaytimm/textpress/issues