URISource {tm}R Documentation

Uniform Resource Identifier Source

Description

Constructs a source which represents documents located by a uniform resource identifier.

Usage

URISource(x, encoding = "unknown")

Arguments

x

A vector of Uniform Resource Identifier, i.e., either a character identifying the file or a connection.

encoding

encoding to be assumed for input strings. It is used to mark character strings as known to be in Latin-1 or UTF-8: it is not used to re-encode the input.

Value

An object of class URISource which extends the class Source representing documents located by a URI.

Author(s)

Ingo Feinerer

See Also

DirSource for accessing a directory, and getSources to list available sources. Encoding on encodings in R.

Examples

loremipsum <- system.file("texts", "loremipsum.txt", package = "tm")
ovid <- system.file("texts", "txt", "ovid_1.txt", package = "tm")
us <- URISource(c(loremipsum, ovid))
inspect(Corpus(us))

[Package tm version 0.5-10 Index]