Author: simonetripodi
Date: Fri Jan 13 14:46:37 2012
New Revision: 1231099
URL: http://svn.apache.org/viewvc?rev=1231099&view=rev
Log:
s/Any23/Apache Any23
Modified:
incubator/any23/trunk/src/site/apt/any23-plugins.apt
incubator/any23/trunk/src/site/apt/configuration.apt
incubator/any23/trunk/src/site/apt/dev-csv-extractor.apt
incubator/any23/trunk/src/site/apt/dev-data-conversion.apt
incubator/any23/trunk/src/site/apt/dev-data-extraction.apt
incubator/any23/trunk/src/site/apt/dev-microdata-extractor.apt
incubator/any23/trunk/src/site/apt/dev-microformat-extractors.apt
incubator/any23/trunk/src/site/apt/dev-validation-fix.apt
incubator/any23/trunk/src/site/apt/developers.apt
incubator/any23/trunk/src/site/apt/extractors.apt
incubator/any23/trunk/src/site/apt/getting-started.apt
incubator/any23/trunk/src/site/apt/index.apt
incubator/any23/trunk/src/site/apt/install.apt
incubator/any23/trunk/src/site/apt/service.apt
incubator/any23/trunk/src/site/apt/supported-formats.apt
Modified: incubator/any23/trunk/src/site/apt/any23-plugins.apt
URL:
http://svn.apache.org/viewvc/incubator/any23/trunk/src/site/apt/any23-plugins.apt?rev=1231099&r1=1231098&r2=1231099&view=diff
==============================================================================
--- incubator/any23/trunk/src/site/apt/any23-plugins.apt (original)
+++ incubator/any23/trunk/src/site/apt/any23-plugins.apt Fri Jan 13 14:46:37
2012
@@ -1,10 +1,10 @@
-Any23 Plugins
+Apache Any23 Plugins
* Introduction
- This section describes the <Any23> plugins support.
+ This section describes the <Apache Any23> plugins support.
- <Any23> comes with a set of predefined plugins.
+ <Apache Any23> comes with a set of predefined plugins.
Such plugins are located under the <any23-root>/<<plugins>> dir.
A plugin is a standard <Maven3> module containing any implementation of
@@ -15,9 +15,9 @@ Any23 Plugins
* How to Register a Plugin
- A plugin can be added to <Any23> by:
+ A plugin can be added to <Apache Any23> by:
- * adding its <JAR> to the <Any23> <JVM classpath>;
+ * adding its <JAR> to the <Apache Any23> <JVM classpath>;
* adding its <JAR> to the <$HOME/.any23/plugins> directory.
@@ -25,7 +25,7 @@ Any23 Plugins
TODO: plugin support in library
- TODO: plugin support in Any23 Service
+ TODO: plugin support in Apache Any23 Service
Any implementation of <ExtractorPlugin> will automatically registered to
the
{{{./xref/org/apache/any23/extractor/ExtractorRegistry.html}ExtractorRegistry}}.
@@ -35,7 +35,7 @@ Any23 Plugins
* How to Build a Plugin
- <Any23> takes care to <test> and <package> plugins when distributed from
its reactor <POM>.
+ <Apache Any23> takes care to <test> and <package> plugins when distributed
from its reactor <POM>.
It is aways possible to rebuild a plugin using the command:
+------------------------------------------
@@ -94,12 +94,12 @@ public class Version implements Tool {
}
public int run(String[] args) {
- final String version = Any23.VERSION;
+ final String version = Apache Any23.VERSION;
if(version == null) {
System.err.println("Error while retrieving configuration info.");
return 1;
}
- System.out.println(String.format("Any23 Core v. %s", version));
+ System.out.println(String.format("Apache Any23 Core v. %s", version));
System.out.println();
return 0;
}
Modified: incubator/any23/trunk/src/site/apt/configuration.apt
URL:
http://svn.apache.org/viewvc/incubator/any23/trunk/src/site/apt/configuration.apt?rev=1231099&r1=1231098&r2=1231099&view=diff
==============================================================================
--- incubator/any23/trunk/src/site/apt/configuration.apt (original)
+++ incubator/any23/trunk/src/site/apt/configuration.apt Fri Jan 13 14:46:37
2012
@@ -42,13 +42,13 @@ final String propertyValue = immutableCo
To obtain a <modifiable>
{{{./xref/org/apache/any23/configuration/Configuration.html} Configuration}}
instead it is possible to use the <<copy()>> method.\
- One of the <<Any23>> constructors accepts a <<Configuration>> object that
allows to customize the behavior
- of the <<Any23>> instance for its entire life-cycle.
+ One of the <<Apache Any23>> constructors accepts a <<Configuration>>
object that allows to customize the behavior
+ of the <<Apache Any23>> instance for its entire life-cycle.
+----------------------------------------------------------------------------------------------
final ModifiableConfiguration modifiableConf = DefaultConfiguration.copy();
final String oldPropertyValue = modifiableConf.setProperty("propertyName",
"new property value");
-final Any23 any23 = new Any23(modifiableConf, "extractor1", ...);
+final Apache Any23 any23 = new Apache Any23(modifiableConf, "extractor1", ...);
...
+----------------------------------------------------------------------------------------------
@@ -56,30 +56,30 @@ final Any23 any23 = new Any23(modifiable
It is possible to customize the behavior of a single data extraction by
providing an
{{{./xref/org/apache/any23/extractor/ExtractionParameters.html}
ExtractionParameters}}
- instance to one the <Any23#extract()> methods accepting it.
<<ExtractionParameters>> allows to customize any <property> and <flag>
+ instance to one the <Apache Any23#extract()> methods accepting it.
<<ExtractionParameters>> allows to customize any <property> and <flag>
other then the <<specific extraction options>>.\
If no custom parameters are specified the default configuration values are
used.
+----------------------------------------------------------------------------------------------
-final Any23 any23 = ...
+final Apache Any23 any23 = ...
final TripleHandler tripleHandler = ...
final ExtractionParameters extractionParameters =
ExtractionParameters.getDefault();
extractionParameters.setFlag("any23.microdata.strict", true);
any23.extract(extractionParameters, "http://path/to/doc", tripleHandler);
+----------------------------------------------------------------------------------------------
-* Any23 Core Module Default Configuration
+* Apache Any23 Core Module Default Configuration
*-------------------------------------------+-------------------------------+------------------------------------------------------------------------------------+
| Property Name | Default Property Value
|Description
|
*-------------------------------------------+-------------------------------+------------------------------------------------------------------------------------+
-| any23.core.version | <current any23 core version>
|String declaring the Any23 Core module version.
|
+| any23.core.version | <current any23 core version>
|String declaring the Apache Any23 Core module version.
|
*-------------------------------------------+-------------------------------+------------------------------------------------------------------------------------+
-|any23.http.user.agent.default |Any23-CLI
|User Agent Name used for HTTP requests.
|
+|any23.http.user.agent.default |Apache Any23-CLI
|User Agent Name used for HTTP requests.
|
*-------------------------------------------+-------------------------------+------------------------------------------------------------------------------------+
|any23.http.client.timeout |10000 (10 secs)
|Timeout in milliseconds for a HTTP request.
|
*-------------------------------------------+-------------------------------+------------------------------------------------------------------------------------+
-|any23.http.client.max.connections |5
|Max number of concurrent HTTP connections allowed by the internal Any23 HTTP
client.|
+|any23.http.client.max.connections |5
|Max number of concurrent HTTP connections allowed by the internal Apache Any23
HTTP client.|
*-------------------------------------------+-------------------------------+------------------------------------------------------------------------------------+
|any23.rdfa.extractor.xslt |rdfa.xslt
|XSLT Stylesheet to be used to perform HTML to RDF extraction of RDFa.
|
*-------------------------------------------+-------------------------------+------------------------------------------------------------------------------------+
@@ -93,7 +93,7 @@ any23.extract(extractionParameters, "htt
*-------------------------------------------+-------------------------------+------------------------------------------------------------------------------------+
|any23.extraction.context.uri |?(means current document URI)
|Default value for extraction content URI.
|
*-------------------------------------------+-------------------------------+------------------------------------------------------------------------------------+
-|any23.plugin.dirs |./plugins
|Directory containing Any23 plugins.
|
+|any23.plugin.dirs |./plugins
|Directory containing Apache Any23 plugins.
|
*-------------------------------------------+-------------------------------+------------------------------------------------------------------------------------+
|any23.microdata.strict |on (possible values: on/off)
|Activates/deactivates the microdata strict validation.
|
*-------------------------------------------+-------------------------------+------------------------------------------------------------------------------------+
Modified: incubator/any23/trunk/src/site/apt/dev-csv-extractor.apt
URL:
http://svn.apache.org/viewvc/incubator/any23/trunk/src/site/apt/dev-csv-extractor.apt?rev=1231099&r1=1231098&r2=1231099&view=diff
==============================================================================
--- incubator/any23/trunk/src/site/apt/dev-csv-extractor.apt (original)
+++ incubator/any23/trunk/src/site/apt/dev-csv-extractor.apt Fri Jan 13
14:46:37 2012
@@ -4,9 +4,9 @@ CSV Extractor Algorithm
an RDF representation of a CSV file compliant with the
{{{http://www.ietf.org/rfc/rfc4180.txt}RFC 4180}}
and that foresees an header.
Such extractor relies on the presence of an header to use the named fields
as RDF properties.
- Field delimiter could be automatically guessed or specified via
{{{./configuration.html}Any23 Configuration}}.
+ Field delimiter could be automatically guessed or specified via
{{{./configuration.html}Apache Any23 Configuration}}.
- Given a document with URL <url>, <<Any23>> uses the following algorithm to
extract RDF:
+ Given a document with URL <url>, <<Apache Any23>> uses the following
algorithm to extract RDF:
* It tries to guess the fields delimiter and to detect the header
Modified: incubator/any23/trunk/src/site/apt/dev-data-conversion.apt
URL:
http://svn.apache.org/viewvc/incubator/any23/trunk/src/site/apt/dev-data-conversion.apt?rev=1231099&r1=1231098&r2=1231099&view=diff
==============================================================================
--- incubator/any23/trunk/src/site/apt/dev-data-conversion.apt (original)
+++ incubator/any23/trunk/src/site/apt/dev-data-conversion.apt Fri Jan 13
14:46:37 2012
@@ -1,7 +1,7 @@
Data Conversion
+----------------------------------------------------------------------------------------------
-/*1*/ Any23 runner = new Any23();
+/*1*/ Apache Any23 runner = new Apache Any23();
/*2*/ final String content = "@prefix foo: <http://example.org/ns#> . " +
"@prefix : <http://other.example.org/ns#> ." +
"foo:bar foo: : . " +
@@ -18,10 +18,10 @@ Data Conversion
/*8*/ String nt = out.toString("UTF-8");
+----------------------------------------------------------------------------------------------
- This example aims to demonstrate how to use <<Any23>> to perform RDF data
conversion.
+ This example aims to demonstrate how to use <<Apache Any23>> to perform RDF
data conversion.
In this code we provide some input data expressed as <<Turtle>> and convert
it in <<NTriples>> format.
- At <<line 1>> we define a new instance of the <<Any23>> facade, that provides
all the methods
+ At <<line 1>> we define a new instance of the <<Apache Any23>> facade, that
provides all the methods
useful for the transformation. The facade constructor accepts a list of
extractor names, if specified
the extraction will be done only over this list, otherwise the data <MIME
Type> will detected and will be applied
all the compatible extractors declared within the
Modified: incubator/any23/trunk/src/site/apt/dev-data-extraction.apt
URL:
http://svn.apache.org/viewvc/incubator/any23/trunk/src/site/apt/dev-data-extraction.apt?rev=1231099&r1=1231098&r2=1231099&view=diff
==============================================================================
--- incubator/any23/trunk/src/site/apt/dev-data-extraction.apt (original)
+++ incubator/any23/trunk/src/site/apt/dev-data-extraction.apt Fri Jan 13
14:46:37 2012
@@ -1,7 +1,7 @@
Data Extraction
+----------------------------------------------------------------------------------------------
-/*1*/ Any23 runner = new Any23();
+/*1*/ Apache Any23 runner = new Apache Any23();
/*2*/ runner.setHTTPUserAgent("test-user-agent");
/*3*/ HTTPClient httpClient = runner.getHTTPClient();
/*4*/ DocumentSource source = new HTTPDocumentSource(
@@ -18,8 +18,8 @@ Data Extraction
/*9*/ String n3 = out.toString("UTF-8");
+----------------------------------------------------------------------------------------------
- This example demonstrates the data extraction, that is the main purpose of
<<Any23>> library.
- At <<line 1>> we define the <<Any23>> facade instance. As described before,
the constructor allows to enforce
+ This example demonstrates the data extraction, that is the main purpose of
<<Apache Any23>> library.
+ At <<line 1>> we define the <<Apache Any23>> facade instance. As described
before, the constructor allows to enforce
the usage of specific extractors.
The <<line 2>> defines the <HTTP User Agent>, used to identify the client
during <HTTP> data collection.
@@ -71,7 +71,7 @@ _:node14r93a8dex1 .
Filter Out Accidental Triples
- To remove accidental triples <<Any23>> provides a set of useful filters,
located
+ To remove accidental triples <<Apache Any23>> provides a set of useful
filters, located
within the <<org.deri.any23.filter>> package.
The filter
{{{./xref/org/apache/any23/filter/IgnoreTitlesOfEmptyDocuments.html}IgnoreTitlesOfEmptyDocuments}}
Modified: incubator/any23/trunk/src/site/apt/dev-microdata-extractor.apt
URL:
http://svn.apache.org/viewvc/incubator/any23/trunk/src/site/apt/dev-microdata-extractor.apt?rev=1231099&r1=1231098&r2=1231099&view=diff
==============================================================================
--- incubator/any23/trunk/src/site/apt/dev-microdata-extractor.apt (original)
+++ incubator/any23/trunk/src/site/apt/dev-microdata-extractor.apt Fri Jan 13
14:46:37 2012
@@ -10,6 +10,6 @@ Microdata Extractor
It is possible to retrieve the <<JSON>> representation of the same Microdata
as defined
at section
{{{http://www.w3.org/TR/microdata/#json}http://www.w3.org/TR/microdata/#json}}
by using the Microdata commandline tool,
- see {{{./getting-started.html#any23tools_script}Getting Started - Any23
Tools}}.
+ see {{{./getting-started.html#any23tools_script}Getting Started - Apache
Any23 Tools}}.
Modified: incubator/any23/trunk/src/site/apt/dev-microformat-extractors.apt
URL:
http://svn.apache.org/viewvc/incubator/any23/trunk/src/site/apt/dev-microformat-extractors.apt?rev=1231099&r1=1231098&r2=1231099&view=diff
==============================================================================
--- incubator/any23/trunk/src/site/apt/dev-microformat-extractors.apt (original)
+++ incubator/any23/trunk/src/site/apt/dev-microformat-extractors.apt Fri Jan
13 14:46:37 2012
@@ -2,16 +2,16 @@ Microformat Extractors
This section describes some extractions corner-cases and their relative RDF
representations.
Main aim of this section is to describe how
- some specific cases are processed with <<Any23>> showing the correspondences
between the extracted RDF triples.
+ some specific cases are processed with <<Apache Any23>> showing the
correspondences between the extracted RDF triples.
{microformat-nesting}
* Nesting different Microformats
[TODO: add picture about microformat nesting structure.]
- This section describes how <<Any23>> represents, with RDF, the content of an
HTML fragments containing different nested
+ This section describes how <<Apache Any23>> represents, with RDF, the content
of an HTML fragments containing different nested
Microformats.
- <<Any23>> performs the extraction executing different extractors for every
supported Microformat on a input HTML page.
+ <<Apache Any23>> performs the extraction executing different extractors for
every supported Microformat on a input HTML page.
There are two different possibilities to write extractors able to produce a
set of RDF triples that coherently
represents this nesting.
@@ -20,7 +20,7 @@ Microformat Extractors
* Embedding explicitly the logic within the
{{{./xref/org/apache/any23/extractor/html/package-summary.html}Microformats
Extractors}}
- * Using the default <<Any23>> nesting feature.
+ * Using the default <<Apache Any23>> nesting feature.
In the first case, the logic for representing the nested values, is directly
embedded in the upper-level Extractor.
For example, the following HTML fragment shows an hCard that contains an
hAddress Microformat.
@@ -96,7 +96,7 @@ public class HCardExtractor extends Enti
+----------------------------------------------------------------------------------------------
- Instead, the second manner is to leave to <<Any23>> the responsibility of
identifying nested Microformats and produce
+ Instead, the second manner is to leave to <<Apache Any23>> the responsibility
of identifying nested Microformats and produce
a set of descriptive RDF triples.
More specifically, the following HTML fragment, provided as a reference
example on
the
{{{http://www.google.com/support/webmasters/bin/answer.py?answer=146862}Google
Webmaster tools blog}},
@@ -117,7 +117,7 @@ public class HCardExtractor extends Enti
</p>
+----------------------------------------------------------------------------------------------
- Due to the fact that the <<Any23>> provided extractors don't explicitly
foresee the possibility of nesting such two
+ Due to the fact that the <<Apache Any23>> provided extractors don't
explicitly foresee the possibility of nesting such two
Microformats, it automatically identifies the nesting relationship and
represents it with the following triples:
+---------------------------------------------------------
Modified: incubator/any23/trunk/src/site/apt/dev-validation-fix.apt
URL:
http://svn.apache.org/viewvc/incubator/any23/trunk/src/site/apt/dev-validation-fix.apt?rev=1231099&r1=1231098&r2=1231099&view=diff
==============================================================================
--- incubator/any23/trunk/src/site/apt/dev-validation-fix.apt (original)
+++ incubator/any23/trunk/src/site/apt/dev-validation-fix.apt Fri Jan 13
14:46:37 2012
@@ -2,7 +2,7 @@ Validation and Fixing
Introduction
- <<Any23>> Is able to detect <<ill-formed HTML DOM content>> and apply fixes
over it.
+ <<Apache Any23>> Is able to detect <<ill-formed HTML DOM content>> and
apply fixes over it.
This section will show how to write RDFa validation Rule and Fix for RDFa.
@@ -10,21 +10,21 @@ Validation and Fixing
These errors may lead to a failures during RDF extraction process from HTML
pages but since they are, typically, syntax errors
they could be easily detected and fixed with some heuristics.
- This pages describes the <<Any23>> rule-based approach, that allows it to
detect, fix and correctly extract
+ This pages describes the <<Apache Any23>> rule-based approach, that allows
it to detect, fix and correctly extract
RDF from those ill-formed RDFa in XHTML pages.
- More specifically, <<Any23>> allows you to write a
{{{./xref/org/apache/any23/validator/Rule.html}Rule}}
+ More specifically, <<Apache Any23>> allows you to write a
{{{./xref/org/apache/any23/validator/Rule.html}Rule}}
able to detect the errors, a
{{{./xref/org/apache/any23/validator/Fix.html}Fix}} containing the logic to fix
the problem and a
{{{./xref/org/apache/any23/validator/Validator.html}Validator}} which acts
as a register of rules and fixes. The Validator
calls all the registered rules and when one of them is applied it calls the
associated Fix.
- The following code snipped shows how to programmatically detect and fix a
very common data error with <<Any23>>.
+ The following code snipped shows how to programmatically detect and fix a
very common data error with <<Apache Any23>>.
Fix Missing Prefix Mappings Declaration
Sometimes, web authors forget to declare prefix mappings. For example, you
can't just use something like dcterms:title
without first declaring the dcterms prefix mapping. If a prefix mapping
isn't declared, the RDFa parser won't understand
- the prefix when it is used in your document. This may lead <<Any23>> to
don't extract such embedded RDF triples.
+ the prefix when it is used in your document. This may lead <<Apache Any23>>
to don't extract such embedded RDF triples.
This:
@@ -42,7 +42,7 @@ Validation and Fixing
</div>
+------------------------------------------------------------------------------------------
- With the <<Any23>>
{{{./xref/org/apache/any23/validator/package-summary.html}Validator}} classes
it's possible to solve this
+ With the <<Apache Any23>>
{{{./xref/org/apache/any23/validator/package-summary.html}Validator}} classes
it's possible to solve this
problem simply implementing the
{{{./xref/org/apache/any23/validator/Rule.html}Rule}} interface as described
below:
+------------------------------------------------------------------------------------------
Modified: incubator/any23/trunk/src/site/apt/developers.apt
URL:
http://svn.apache.org/viewvc/incubator/any23/trunk/src/site/apt/developers.apt?rev=1231099&r1=1231098&r2=1231099&view=diff
==============================================================================
--- incubator/any23/trunk/src/site/apt/developers.apt (original)
+++ incubator/any23/trunk/src/site/apt/developers.apt Fri Jan 13 14:46:37 2012
@@ -1,32 +1,32 @@
Developers Guide
- This section introduces some <<Any23>> programming fundamentals.
+ This section introduces some <<Apache Any23>> programming fundamentals.
* {{{./dev-data-extraction.html}Data Extraction}}
- Explains how to extract RDF data from HTTP resources with <<Any23>>.
+ Explains how to extract RDF data from HTTP resources with <<Apache Any23>>.
* {{{./dev-data-conversion.html}Data Conversion}}
- Shows how to perform RDF data conversion with <<Any23>>.
+ Shows how to perform RDF data conversion with <<Apache Any23>>.
* {{{./dev-validation-fix.html}Validation and Fixing}}
- Demonstrates how to define validation and correction rules for HTML content
with <<Any23>>.
+ Demonstrates how to define validation and correction rules for HTML content
with <<Apache Any23>>.
* {{{./dev-xpath-extractor.html}XPath Extractor}}
- Explains how to write custom scraping rules for extracting RDF data from any
HTML content with <<Any23>>.
+ Explains how to write custom scraping rules for extracting RDF data from any
HTML content with <<Apache Any23>>.
* {{{./dev-microformat-extractors.html}Microformat Extractors}}
- Explains how to write new Microformat extractors with <<Any23>> and also
report interesting notes on
+ Explains how to write new Microformat extractors with <<Apache Any23>> and
also report interesting notes on
microformat nesting representation.
* {{{./dev-microdata-extractor.html}Microdata Extractor}}
- Explains how it works the Microdata Extractor embedded in <<Any23>>.
+ Explains how it works the Microdata Extractor embedded in <<Apache Any23>>.
* {{{./dev-csv-extractor.html}CSV Extractor}}
- Explains how it works the CSV Extractor embedded in <<Any23>>.
\ No newline at end of file
+ Explains how it works the CSV Extractor embedded in <<Apache Any23>>.
\ No newline at end of file
Modified: incubator/any23/trunk/src/site/apt/extractors.apt
URL:
http://svn.apache.org/viewvc/incubator/any23/trunk/src/site/apt/extractors.apt?rev=1231099&r1=1231098&r2=1231099&view=diff
==============================================================================
--- incubator/any23/trunk/src/site/apt/extractors.apt (original)
+++ incubator/any23/trunk/src/site/apt/extractors.apt Fri Jan 13 14:46:37 2012
@@ -1,6 +1,6 @@
-Any23 Extractors
+Apache Any23 Extractors
- This page enlists all the Any23 Extractors (see source code
{{{./xref/org/apache/any23/extractor/package-summary.html}package}}).
+ This page enlists all the Apache Any23 Extractors (see source code
{{{./xref/org/apache/any23/extractor/package-summary.html}package}}).
* Microformat Extractors
Modified: incubator/any23/trunk/src/site/apt/getting-started.apt
URL:
http://svn.apache.org/viewvc/incubator/any23/trunk/src/site/apt/getting-started.apt?rev=1231099&r1=1231098&r2=1231099&view=diff
==============================================================================
--- incubator/any23/trunk/src/site/apt/getting-started.apt (original)
+++ incubator/any23/trunk/src/site/apt/getting-started.apt Fri Jan 13 14:46:37
2012
@@ -1,14 +1,14 @@
-Getting started with <<Any23>>
+Getting started with <<Apache Any23>>
- <<Any23>> can be used:
+ <<Apache Any23>> can be used:
* via CLI (command line interface) from your preferred shell environment;
* as a RESTful Webservice;
* as a library.
-* <<Any23>> Modules
+* <<Apache Any23>> Modules
- <<Any23>> is composed of the following modules:
+ <<Apache Any23>> is composed of the following modules:
* <<<any23-core/>>> The core library.
@@ -16,11 +16,11 @@ Getting started with <<Any23>>
* <<<any23-plugins/>>> The core additional plugins.
-* Use the <<Any23>> CLI
+* Use the <<Apache Any23>> CLI
The command-line tools support is provided by the <<any23-core>> module.
- Once <<Any23>> has been correctly {{{./install.html}installed}}, if you
want to use it as a commandline tool,
+ Once <<Apache Any23>> has been correctly {{{./install.html}installed}}, if
you want to use it as a commandline tool,
use the shell scripts within the <<"any23-core/bin">> directory.
These are provided both for Unix (Linux/OSX).
@@ -38,9 +38,9 @@ Usage: ToolRunner <utility> [options...]
MicrodataParser Commandline Tool for extracting
Microdata from file/HTTP source.
MimeDetector
MIME Type Detector Tool.
PluginVerifier Utility for
plugin management verification.
- Rover
Any23 Command Line Tool.
+ Rover
Apache Any23 Command Line Tool.
Version Prints out the current library version
and configuration information.
- VocabPrinter Prints out the RDF Schema of
the vocabularies used by Any23.
+ VocabPrinter Prints out the RDF Schema of
the vocabularies used by Apache Any23.
+-------------------------------------------
The <any23tools> script detects a list of available utilities within the
<<any23-core>> and <<plugins>>
@@ -56,13 +56,13 @@ Usage: ToolRunner <utility> [options...]
* <<<MimeDetector>>>: detects the MIME Type for any HTTP / file /
direct input resource.
- * <<<PluginVerifier>>>: a utility for verifying <Any23> plugins.
+ * <<<PluginVerifier>>>: a utility for verifying <Apache Any23> plugins.
* <<<Rover>>>: the RDF extraction tool.
* <<<Version>>>: prints out useful information about the library
version and configuration.
- * <<<VocabPrinter>>>: allows to dump all the <<RDFSchema>> vocabularies
declared within Any23.
+ * <<<VocabPrinter>>>: allows to dump all the <<RDFSchema>> vocabularies
declared within Apache Any23.
** Rover
@@ -115,7 +115,7 @@ any23-core/bin$ ./any23tools Rover -f qu
Filtering trivial statements
- By default, <<Any23>> will extract <HTML/head> meta information, such as
links to <CSS stylesheets> or meta
+ By default, <<Apache Any23>> will extract <HTML/head> meta information,
such as links to <CSS stylesheets> or meta
information like the author or the software used to create the <html>.
Hence, if the user is only interested
in the structured content from the <HTML/body> tag we offer a filter
functionality, activated by the <<"-t">>
command line argument.
@@ -188,7 +188,7 @@ Usage: {http://path/to/resource.html|fil
** VocabPrinter
- The VocabPrinter Tool prints out the RDFSchema declared by all the <<Any23>>
+ The VocabPrinter Tool prints out the RDFSchema declared by all the <<Apache
Any23>>
declared vocabularies.
<<This tool is still in beta version.>>
@@ -214,9 +214,9 @@ text/n3
TODO: missing.
-* <<Any23>> CLI <Plugins>
+* <<Apache Any23>> CLI <Plugins>
- The <<Any23>> ToolRunner CLI (<bin/any23tools>) supports the auto detection
of Tool plugins within the classpath.
+ The <<Apache Any23>> ToolRunner CLI (<bin/any23tools>) supports the auto
detection of Tool plugins within the classpath.
For further details see {{{./any23-plugins.html}Plugins}} section.
The default <<any23>> CLI plugins are enlisted below.
@@ -272,10 +272,10 @@ usage: [{<url>|<file>}]+ [-d <arg>] [-e
any23-core/bin$ ./any23tools Crawler -s -f ntriples http://www.repubblica.it
1> out.nt 2> repubblica.log
+----------------------------------------------------------------------------
-* Use <<Any23>> as a RESTful Web Service
+* Use <<Apache Any23>> as a RESTful Web Service
- <<Any23>> provides a Web Service that can be used to extract <RDF> from Web
documents.
- <<Any23>> services can be accessed through a {{{./service.html}RESTful
API}}.
+ <<Apache Any23>> provides a Web Service that can be used to extract <RDF>
from Web documents.
+ <<Apache Any23>> services can be accessed through a
{{{./service.html}RESTful API}}.
Running the server
@@ -289,9 +289,9 @@ any23-service/bin$ ./any23server
from the command line in order to start up the server, then go to
{{{http://localhost:8080/}}}
to access the web interface. A live demo version of such service is
running at {{{http://any23.org/}}}.
You can also start the server from Java by running the
- {{{./xref/org/apache/any23/servlet/Servlet.html}Any23 Servlet}} class.
Maven can be used to create a WAR
+ {{{./xref/org/apache/any23/servlet/Servlet.html}Apache Any23 Servlet}}
class. Maven can be used to create a WAR
file for deployment into an existing servlet container such as
{{{http://tomcat.apache.org/}Apache Tomcat}}.
-* Use <<Any23>> as a Library
+* Use <<Apache Any23>> as a Library
See our {{{./developers.html}Developers guide}} for more details.
Modified: incubator/any23/trunk/src/site/apt/index.apt
URL:
http://svn.apache.org/viewvc/incubator/any23/trunk/src/site/apt/index.apt?rev=1231099&r1=1231098&r2=1231099&view=diff
==============================================================================
--- incubator/any23/trunk/src/site/apt/index.apt (original)
+++ incubator/any23/trunk/src/site/apt/index.apt Fri Jan 13 14:46:37 2012
@@ -1,4 +1,4 @@
-Introduction to Any23
+Introduction to Apache Any23
* Library
@@ -17,8 +17,8 @@ Introduction to Any23
A detailed description of available extractors is
{{{./extractors.html}here}}.
- <<Any23>> is used in major Web of Data applications such as
{{{http://sindice.com/}sindice.com}} and {{{http://sig.ma/}sig.ma}}. It is
written in Java and licensed under the
{{{http://any23.googlecode.com/svn/trunk/LICENSE.txt}Apache License}}.
- <<Any23>> can be used in various ways:
+ <<Apache Any23>> is used in major Web of Data applications such as
{{{http://sindice.com/}sindice.com}} and {{{http://sig.ma/}sig.ma}}. It is
written in Java and licensed under the
{{{http://any23.googlecode.com/svn/trunk/LICENSE.txt}Apache License}}.
+ <<Apache Any23>> can be used in various ways:
* As a library in Java applications that consume structured data from the
Web.
* As a command-line tool for extracting and converting between the
supported formats.
* As online service API available at {{{http://any23.org/}any23.org}}.
@@ -29,19 +29,19 @@ Introduction to Any23
{{{./index.html}Introduction}}: this page.
- {{{./install.html}Install}}: how to install <<Any23>> library and service.
+ {{{./install.html}Install}}: how to install <<Apache Any23>> library and
service.
- {{{./getting-started.html} Getting Started}}: start using <<Any23>>
command-line tools.
+ {{{./getting-started.html} Getting Started}}: start using <<Apache Any23>>
command-line tools.
- {{{./supported-formats.html}Supported Formats}}: complete list of
<<Semantic Web>> formats supported by <<Any23>>.
+ {{{./supported-formats.html}Supported Formats}}: complete list of
<<Semantic Web>> formats supported by <<Apache Any23>>.
{{{./configuration.html}Configuration}}: learn how to change default
library and service configuration.
- {{{./service.html}REST Service}}: discover how to use the <<Any23 REST
Service>>.
+ {{{./service.html}REST Service}}: discover how to use the <<Apache Any23
REST Service>>.
- {{{./any23-plugins.html}Plugins}}: read how to install and configure the
<<Any23>> plugins.
+ {{{./any23-plugins.html}Plugins}}: read how to install and configure the
<<Apache Any23>> plugins.
- {{{./developers.html}Developers}}: understand the <<Any23>> code internals,
+ {{{./developers.html}Developers}}: understand the <<Apache Any23>> code
internals,
how to write plugins, fixing rules and customize the code.
* Community
Modified: incubator/any23/trunk/src/site/apt/install.apt
URL:
http://svn.apache.org/viewvc/incubator/any23/trunk/src/site/apt/install.apt?rev=1231099&r1=1231098&r2=1231099&view=diff
==============================================================================
--- incubator/any23/trunk/src/site/apt/install.apt (original)
+++ incubator/any23/trunk/src/site/apt/install.apt Fri Jan 13 14:46:37 2012
@@ -1,6 +1,6 @@
-Any23 Installation Guide
+Apache Any23 Installation Guide
- This page describes how to install <<Any23>>.
+ This page describes how to install <<Apache Any23>>.
* Download a Stable Distribution
@@ -17,7 +17,7 @@ Any23 Installation Guide
$ svn checkout http://any23.googlecode.com/svn/trunk any23-trunk-readonly
+---------------------------------------------------------------------------
-* Build <<Any23>>
+* Build <<Apache Any23>>
The following instructions describe how to build the library with
{{{http://maven.apache.org/}Maven 2.x.y+}}.
For specific information about Maven see: {{{http://maven.apache.org/}}}
@@ -33,7 +33,7 @@ $ cd trunk/
trunk$ mvn clean install
+-----------
- This will install the <<Any23>> artifact and its dependencies in your
local M2 repository.
+ This will install the <<Apache Any23>> artifact and its dependencies in
your local M2 repository.
* Generate Documentation
Modified: incubator/any23/trunk/src/site/apt/service.apt
URL:
http://svn.apache.org/viewvc/incubator/any23/trunk/src/site/apt/service.apt?rev=1231099&r1=1231098&r2=1231099&view=diff
==============================================================================
--- incubator/any23/trunk/src/site/apt/service.apt (original)
+++ incubator/any23/trunk/src/site/apt/service.apt Fri Jan 13 14:46:37 2012
@@ -1,6 +1,6 @@
-Any23 REST Service
+Apache Any23 REST Service
- <Any23> provides REST Service module <any23-service> able to provide
useful processing methods.
+ <Apache Any23> provides REST Service module <any23-service> able to
provide useful processing methods.
* Compact API
@@ -103,7 +103,7 @@ Code Reason
{report-format}
- The <Any23 Service> can optionally return an XML report and attempt error
fix if
+ The <Apache Any23 Service> can optionally return an XML report and attempt
error fix if
the flags <fix> and <report> are activated ( <fix=on&report=on> ).
The following URL shows how to use these flags.
Modified: incubator/any23/trunk/src/site/apt/supported-formats.apt
URL:
http://svn.apache.org/viewvc/incubator/any23/trunk/src/site/apt/supported-formats.apt?rev=1231099&r1=1231098&r2=1231099&view=diff
==============================================================================
--- incubator/any23/trunk/src/site/apt/supported-formats.apt (original)
+++ incubator/any23/trunk/src/site/apt/supported-formats.apt Fri Jan 13
14:46:37 2012
@@ -1,48 +1,48 @@
-Supported Formats in Any23
+Supported Formats in Apache Any23
- <<Any23>> supports all the main standard formats introduced by the
<<Semantic Web>> community.
+ <<Apache Any23>> supports all the main standard formats introduced by the
<<Semantic Web>> community.
* <<Input Formats>>
The following list shows the accepted input formats and for each one the
support level.
- * <<(X)HTML>> with <<RDFa 1.0>>, <<RDFa 1.1>>, <<Microdata>> and
<<Microformats>>. <<Any23>> fully supports the
+ * <<(X)HTML>> with <<RDFa 1.0>>, <<RDFa 1.1>>, <<Microdata>> and
<<Microformats>>. <<Apache Any23>> fully supports the
{{{http://www.w3.org/TR/html5/}(X)HTML5}} input format and in particular
provides a set of extractors for processing embedded
{{{http://www.w3.org/TR/rdfa-syntax/}RDFa 1.0}},
{{{http://www.w3.org/TR/rdfa-core/}RDFa 1.1}},
{{{http://microformats.org/}Microformats}}
and {{{http://www.w3.org/TR/microdata/}Microdata}}.
- * <<Turtle>> <<Any23>> fully supports the
{{{http://www.w3.org/TeamSubmission/turtle/}Turtle}} specification.
+ * <<Turtle>> <<Apache Any23>> fully supports the
{{{http://www.w3.org/TeamSubmission/turtle/}Turtle}} specification.
- * <<N-Triples>> <<Any23>> fully supports the
{{{http://www.w3.org/TR/rdf-testcases/#ntriples}N-Triples}} specification.
+ * <<N-Triples>> <<Apache Any23>> fully supports the
{{{http://www.w3.org/TR/rdf-testcases/#ntriples}N-Triples}} specification.
- * <<N-Quads>> <<Any23>> fully supports the
{{{http://sw.deri.org/2008/07/n-quads/}N-Quads}} specification.
+ * <<N-Quads>> <<Apache Any23>> fully supports the
{{{http://sw.deri.org/2008/07/n-quads/}N-Quads}} specification.
- * <<RDF/XML>> <<Any23>> fully supports the
{{{http://www.w3.org/TR/rdf-syntax-grammar/}RDF/XML}} specification.
+ * <<RDF/XML>> <<Apache Any23>> fully supports the
{{{http://www.w3.org/TR/rdf-syntax-grammar/}RDF/XML}} specification.
- * <<CSV>> <<Any23>> allows you to represent header-provided
{{{http://www.ietf.org/rfc/rfc4180.txt}CSV}} files with RDF using a specific
{{{./dev-csv-extractor.html}algorithm}}.
+ * <<CSV>> <<Apache Any23>> allows you to represent header-provided
{{{http://www.ietf.org/rfc/rfc4180.txt}CSV}} files with RDF using a specific
{{{./dev-csv-extractor.html}algorithm}}.
* <<Output Formats>>
The supported output formats are enlisted below.
- * <<Turtle>> <<Any23>> is able to produce output in
{{{http://www.w3.org/TeamSubmission/turtle/}Turtle}}.
+ * <<Turtle>> <<Apache Any23>> is able to produce output in
{{{http://www.w3.org/TeamSubmission/turtle/}Turtle}}.
- * <<N-Triples>> <<Any23>> is able to produce output in
{{{http://www.w3.org/TR/rdf-testcases/#ntriples}N-Triples}}.
+ * <<N-Triples>> <<Apache Any23>> is able to produce output in
{{{http://www.w3.org/TR/rdf-testcases/#ntriples}N-Triples}}.
- * <<N-Quads>> <<Any23>> is able to produce output in
{{{http://sw.deri.org/2008/07/n-quads/}N-Quads}}.
+ * <<N-Quads>> <<Apache Any23>> is able to produce output in
{{{http://sw.deri.org/2008/07/n-quads/}N-Quads}}.
- * <<RDF/XML>> <<Any23>> is able to produce output in
{{{http://www.w3.org/TR/rdf-syntax-grammar/}RDF/XML}}.
+ * <<RDF/XML>> <<Apache Any23>> is able to produce output in
{{{http://www.w3.org/TR/rdf-syntax-grammar/}RDF/XML}}.
- * <<JSON Statements>> <<Any23>> is able to produce output in
{{{http://www.json.org/}JSON}} . See the specific {{{json-statements}format}}.
+ * <<JSON Statements>> <<Apache Any23>> is able to produce output in
{{{http://www.json.org/}JSON}} . See the specific {{{json-statements}format}}.
- * <<XML Report>> <<Any23>> is able to produce a detailed report of the
latest document extraction if required. See further details
{{{./service.html#report-format}here}}.
+ * <<XML Report>> <<Apache Any23>> is able to produce a detailed report of
the latest document extraction if required. See further details
{{{./service.html#report-format}here}}.
* JSON Statements Format
{json-statements}
- Any23 is able to produce JSON output following the format described below.
+ Apache Any23 is able to produce JSON output following the format
described below.
Given the following example statements (expressed in N-Quads format):