| URISource {tm} | R Documentation |
Constructs a source which represents documents located by a uniform resource identifier.
URISource(x, encoding = "unknown")
x |
A vector of Uniform Resource Identifier, i.e., either a character identifying the file or a connection. |
encoding |
encoding to be assumed for input strings. It is used to mark character strings as known to be in Latin-1 or UTF-8: it is not used to re-encode the input. |
An object of class URISource which extends the class
Source representing documents located by a URI.
Ingo Feinerer
DirSource for accessing a directory, and
getSources to list available sources.
Encoding on encodings in R.
loremipsum <- system.file("texts", "loremipsum.txt", package = "tm")
ovid <- system.file("texts", "txt", "ovid_1.txt", package = "tm")
us <- URISource(c(loremipsum, ovid))
inspect(Corpus(us))