I had a look at generating these using a little ruby crawling and mahout. 
[1][2] Naturally this has taken a great deal more time than comparing each page 
manually. Results are included at the end of this email, and could easily be 
rejigged into an htaccess file. The form is:

<old uri> ->
   ...0 or more...
   <candidate apache uri> (<similarity score>)

The more astute reader will notice that I could have just compared the end of 
the uris :-)

Damian

[1] <http://mahout.apache.org/>
[2] <https://bitbucket.org/shellac/redirect-miner/overview>

http://openjena.org/wiki/Main_Page ->
http://openjena.org/wiki/SDB ->
http://openjena.org/wiki/TDB ->
http://openjena.org/wiki/SPARQL_Extensions ->
http://openjena.org/wiki/ARQ ->
http://openjena.org/wiki/JenaMulgara ->
http://openjena.org/wiki/SSE ->
    http://incubator.apache.org/jena/documentation/notes/sse.html 
(0.940895209383557)
    http://incubator.apache.org/jena/documentation/query/algebra.html 
(0.251858984739002)
    http://incubator.apache.org/jena/documentation/query/arq-query-eval.html 
(0.247283196667233)
http://openjena.org/wiki/Fuseki ->
    http://incubator.apache.org/jena/documentation/serving_data/ 
(0.865087874887867)
    http://incubator.apache.org/jena/documentation/serving_data/index.html 
(0.865087874887866)
    http://incubator.apache.org/jena/documentation/serving_data/soh.html 
(0.372591848747247)
    http://incubator.apache.org/jena/documentation/tdb/assembler.html 
(0.337512617542261)
http://openjena.org/wiki/RIOT ->
    http://incubator.apache.org/jena/documentation/io/riot.html 
(0.819405587343479)
http://openjena.org/wiki/SOH ->
    http://incubator.apache.org/jena/documentation/serving_data/soh.html 
(0.802637364456163)
http://openjena.org/wiki/SDB/Quickstart ->
    http://incubator.apache.org/jena/documentation/sdb/quickstart.html 
(0.708293359964489)
http://openjena.org/wiki/SDB/Installation ->
    http://incubator.apache.org/jena/documentation/sdb/installation.html 
(0.643619808497736)
http://openjena.org/wiki/SDB/Commands ->
    http://incubator.apache.org/jena/documentation/sdb/commands.html 
(0.876295810427175)
http://openjena.org/wiki/SDB/Store_Description ->
    http://incubator.apache.org/jena/documentation/sdb/store_description.html 
(0.817560977427525)
http://openjena.org/wiki/SDB/Dataset_Description ->
    http://incubator.apache.org/jena/documentation/sdb/dataset_description.html 
(0.75380772694568)
http://openjena.org/wiki/SDB/Configuration ->
    http://incubator.apache.org/jena/documentation/sdb/configuration.html 
(0.851370601516982)
    http://incubator.apache.org/jena/documentation/tdb/configuration.html 
(0.379339281677876)
http://openjena.org/wiki/SDB/JavaAPI ->
    http://incubator.apache.org/jena/documentation/sdb/javaapi.html 
(0.900273404120858)
    http://incubator.apache.org/jena/documentation/sdb/store_description.html 
(0.335630109753941)
http://openjena.org/wiki/SDB/Database_Layouts ->
    http://incubator.apache.org/jena/documentation/sdb/database_layouts.html 
(0.745030654801725)
http://openjena.org/wiki/SDB/Joseki_Integration ->
    http://incubator.apache.org/jena/documentation/sdb/joseki_integration.html 
(0.766100908909383)
http://openjena.org/wiki/SDB/FAQ ->
    http://incubator.apache.org/jena/documentation/sdb/faq.html 
(0.627237320546457)
http://openjena.org/wiki/SDB/Databases_Supported ->
    http://incubator.apache.org/jena/documentation/sdb/databases_supported.html 
(0.527626049308696)
http://openjena.org/wiki/SDB/Support ->
http://openjena.org/wiki/SDB/Loading_performance ->
    http://incubator.apache.org/jena/documentation/sdb/loading_performance.html 
(0.861511376797829)
http://openjena.org/wiki/SDB/Loading_data ->
    http://incubator.apache.org/jena/documentation/sdb/loading_data.html 
(0.90451825247527)
http://openjena.org/wiki/SDB/Query ->
http://openjena.org/wiki/SDB/Query_performance ->
    http://incubator.apache.org/jena/documentation/sdb/query_performance.html 
(0.83331910761654)
http://openjena.org/wiki/SDB/NotesPostgreSQL ->
http://openjena.org/wiki/SDB/NotesMySQL ->
http://openjena.org/wiki/SDB/NotesDerby ->
http://openjena.org/wiki/SDB/NotesMSSQL ->
http://openjena.org/wiki/SDB/NotesDB2 ->
http://openjena.org/wiki/TDB/Requirements ->
    http://incubator.apache.org/jena/documentation/tdb/requirements.html 
(0.680413570081196)
http://openjena.org/wiki/TDB/JVM-64-32 ->
    http://incubator.apache.org/jena/documentation/tdb/jvm_64_32.html 
(0.732310846150155)
http://openjena.org/wiki/TDB/JavaAPI ->
    http://incubator.apache.org/jena/documentation/tdb/java_api.html 
(0.76568185957999)
http://openjena.org/wiki/TDB/Installation ->
http://openjena.org/wiki/TDB/Commands ->
    http://incubator.apache.org/jena/documentation/tdb/commands.html 
(0.72947603230751)
http://openjena.org/wiki/TDB/Datasets ->
    http://incubator.apache.org/jena/documentation/tdb/datasets.html 
(0.761534103091479)
http://openjena.org/wiki/TDB/QuadFilter ->
    http://incubator.apache.org/jena/documentation/tdb/quadfilter.html 
(0.831796876191203)
http://openjena.org/wiki/TDB/ValueCanonicalization ->
    
http://incubator.apache.org/jena/documentation/tdb/value_canonicalization.html 
(0.761273159113877)
http://openjena.org/wiki/TDB/DynamicDatasets ->
    http://incubator.apache.org/jena/documentation/tdb/dynamic_datasets.html 
(0.708953988076054)
http://openjena.org/wiki/TDB/Assembler ->
    http://incubator.apache.org/jena/documentation/tdb/assembler.html 
(0.889383733892633)
http://openjena.org/wiki/TDB/Optimizer ->
    http://incubator.apache.org/jena/documentation/tdb/optimizer.html 
(0.941873402152256)
http://openjena.org/wiki/TDB/Architecture ->
    http://incubator.apache.org/jena/documentation/tdb/architecture.html 
(0.894742673843093)
    http://incubator.apache.org/jena/documentation/tdb/jvm_64_32.html 
(0.437922833349563)
http://openjena.org/wiki/TDB/Configuration ->
    http://incubator.apache.org/jena/documentation/tdb/configuration.html 
(0.807278273651406)
    http://incubator.apache.org/jena/documentation/sdb/configuration.html 
(0.377209580591843)
http://openjena.org/wiki/TDB/Joseki_Integration ->
http://openjena.org/wiki/ARQ/Explain ->
http://openjena.org/wiki/ARQ/Manipulating_SPARQL_using_ARQ ->
    http://incubator.apache.org/jena/documentation/query/arq-query-eval.html 
(0.246126793420272)
http://openjena.org/wiki/ARQ/Logging ->
http://openjena.org/wiki/TDB/Concurrency ->

Attachment: signature.asc
Description: Message signed with OpenPGP using GPGMail

Reply via email to