程序包开发：如何从程序包导入数据，对其进行转换并将rexport作为数据集？

泰勒·林克

使用roxygen2框架，如何从另一个包中导入数据集，进行更改，然后将数据集重新导出为自己包中的数据集？

以我在导出数据集方面的经验，可以通过保存.rda文件（通常使用该save函数）来手动完成此过程。我想使其更具动态性，因此，当人们更新依赖项包时，如果另一个包更新了数据集，则我的包将相应地更新其数据集。

例如，假设我stop_words要从tidytext导入数据集，删除SMART类型lexicon并重新导出为stop_words2。有没有办法做到这一点？我将知道此解决方案何时data(package = 'MyPackage')可以显示重新导出的数据集。

我的尝试无效（data(package =即使可访问数据也无效）：

#' Various lexicons for English stop words
#'
#' English stop words from three lexicons, as a data frame.
#' The onix sets are pulled from the tm package. Note
#' that words with non-ASCII characters have been removed.  THis
#' is a reimport from the \pkg{tidytext} package's \code{stop_words}
#' data set but with the SMART lexicon filtered out.
#'
#' @format A data frame with 578 rows and 2 variables:
#' \describe{
#'  \item{word}{An English word}
#'  \item{lexicon}{The source of the stop word. Either "onix" or "snowball"}
#'  }
#' @usage data(sam_i_am2)
#' @export
stop_words2 <- tidytext::stop_words[tidytext::stop_words[['lexicon']] != 'SMART', ]

肯·贝努瓦

我认为这是不可能的，因为data()仅在子目录data/中进行搜索，而子目录不在重新导出数据对象的位置。

但是，如果放弃此目标，则仍然可以像访问“延迟加载”数据集一样访问新数据对象。但请注意，使用不能正常工作data(stop_words2, package = "MyPackage")。

#' Various lexicons for English stop words
#'
#' English stop words from three lexicons, as a data frame. The onix sets are
#' pulled from the tm package. Note that words with non-ASCII characters have
#' been removed.  This is a reimport from the \pkg{tidytext} package's
#' \code{stop_words} data set but with the SMART lexicon filtered out.
#' @inherit tidytext::stop_words title description source references
#' @export
stop_words2 <- tidytext::stop_words[tidytext::stop_words[["lexicon"]] != "SMART", ]

请注意roxygen2对回收原始文档组件的使用。

考虑使用stopwords软件包，该软件包包含SMART单词以及更多内容。

本文收集自互联网，转载请注明来源。

如有侵权，请联系 [email protected] 删除。

编辑于 2020-11-5

我来说两句

0 条评论

登录后参与评论

TOP 榜单

文章

程序包开发：如何从程序包导入数据，对其进行转换并将rexport作为数据集？

程序包开发：如何从程序包导入数据，对其进行转换并将rexport作为数据集？

IE 11中的FormData未定义

如何一次从多个文本框中获取值？

在 Python 2.7 中。如何从文件中读取特定文本并分配给变量

OpenCv：改变 putText() 的位置

Redux动作正常，但减速器无效

如何从JavaScript中的MP3文件读取元数据属性？

如何使用Redux-Toolkit重置Redux Store

将加号/减号添加到jQuery菜单

OpenGL纹理格式的颜色错误

获取并汇总所有关联的数据

超过时间限制错误C ++

ActiveModelSerializer仅显示关联的ID

在交互式Python Shell中获得最后结果

如何开始为Ubuntu开发

去噪自动编码器和常规自动编码器有什么区别？

Excel 2016图表将增长与4个参数进行比较

算术中的c ++常量类型转换

使用因子时如何在y轴上的ggplot中插入count或％

TreeMap中的自定义排序

如何在R中转置数据

在 React Native Expo 中使用 react-redux 更改另一个键的值