readReut21578XML {tm}R Documentation

Read In a Reuters-21578 XML Document

Description

Read in a Reuters-21578 XML document.

Usage

readReut21578XML(elem, language, id)
readReut21578XMLasPlain(elem, language, id)

Arguments

elem

a list with the named component content which must hold the document to be read in.

language

a string giving the text's language.

id

a unique identification string for the returned text document.

Value

A Reuters21578Document for readReut21578XML, or a PlainTextDocument for readReut21578XMLasPlain.

Author(s)

Ingo Feinerer

References

Lewis, David (1997) Reuters-21578 Text Categorization Collection Distribution 1.0. http://kdd.ics.uci.edu/databases/reuters21578/reuters21578.html

Luz, Saturnino XML-encoded version of Reuters-21578. http://modnlp.berlios.de/reuters21578.html

See Also

getReaders to list available reader functions.


[Package tm version 0.5-10 Index]