RCV1Document {tm}R Documentation

RCV1 Text Document

Description

Construct an object representing a RCV1 XML text document with meta information.

Usage

RCV1Document(x, author = character(0),
             datetimestamp = as.POSIXlt(Sys.time(), tz = "GMT"),
             description = character(0), heading = character(0),
             id = character(0), origin = character(0),
             language = character(0), localmetadata = list())

Arguments

x

Object of class list containing the content.

author

Object of class character containing the author names.

datetimestamp

Object of class POSIXlt containing the date and time when the document was written.

description

Object of class character containing additional text information.

heading

Object of class character containing the title or a short heading.

id

Object of class character containing an identifier.

origin

Object of class character containing information on the source and origin of the text.

language

Object of class character containing the language of the text (preferably as IETF language tags).

localmetadata

Object of class list containing local meta data in form of tag-value pairs.

Author(s)

Ingo Feinerer

References

Lewis, D. D.; Yang, Y.; Rose, T.; and Li, F (2004). RCV1: A New Benchmark Collection for Text Categorization Research. Journal of Machine Learning Research, 5, 361–397. http://www.jmlr.org/papers/volume5/lewis04a/lewis04a.pdf

See Also

PlainTextDocument and Reuters21578Document


[Package tm version 0.5-10 Index]