I had a look at generating these using a little ruby crawling and mahout. [1][2] Naturally this has taken a great deal more time than comparing each page manually. Results are included at the end of this email, and could easily be rejigged into an htaccess file. The form is:
<old uri> -> ...0 or more... <candidate apache uri> (<similarity score>) The more astute reader will notice that I could have just compared the end of the uris :-) Damian [1] <http://mahout.apache.org/> [2] <https://bitbucket.org/shellac/redirect-miner/overview> http://openjena.org/wiki/Main_Page -> http://openjena.org/wiki/SDB -> http://openjena.org/wiki/TDB -> http://openjena.org/wiki/SPARQL_Extensions -> http://openjena.org/wiki/ARQ -> http://openjena.org/wiki/JenaMulgara -> http://openjena.org/wiki/SSE -> http://incubator.apache.org/jena/documentation/notes/sse.html (0.940895209383557) http://incubator.apache.org/jena/documentation/query/algebra.html (0.251858984739002) http://incubator.apache.org/jena/documentation/query/arq-query-eval.html (0.247283196667233) http://openjena.org/wiki/Fuseki -> http://incubator.apache.org/jena/documentation/serving_data/ (0.865087874887867) http://incubator.apache.org/jena/documentation/serving_data/index.html (0.865087874887866) http://incubator.apache.org/jena/documentation/serving_data/soh.html (0.372591848747247) http://incubator.apache.org/jena/documentation/tdb/assembler.html (0.337512617542261) http://openjena.org/wiki/RIOT -> http://incubator.apache.org/jena/documentation/io/riot.html (0.819405587343479) http://openjena.org/wiki/SOH -> http://incubator.apache.org/jena/documentation/serving_data/soh.html (0.802637364456163) http://openjena.org/wiki/SDB/Quickstart -> http://incubator.apache.org/jena/documentation/sdb/quickstart.html (0.708293359964489) http://openjena.org/wiki/SDB/Installation -> http://incubator.apache.org/jena/documentation/sdb/installation.html (0.643619808497736) http://openjena.org/wiki/SDB/Commands -> http://incubator.apache.org/jena/documentation/sdb/commands.html (0.876295810427175) http://openjena.org/wiki/SDB/Store_Description -> http://incubator.apache.org/jena/documentation/sdb/store_description.html (0.817560977427525) http://openjena.org/wiki/SDB/Dataset_Description -> http://incubator.apache.org/jena/documentation/sdb/dataset_description.html (0.75380772694568) http://openjena.org/wiki/SDB/Configuration -> http://incubator.apache.org/jena/documentation/sdb/configuration.html (0.851370601516982) http://incubator.apache.org/jena/documentation/tdb/configuration.html (0.379339281677876) http://openjena.org/wiki/SDB/JavaAPI -> http://incubator.apache.org/jena/documentation/sdb/javaapi.html (0.900273404120858) http://incubator.apache.org/jena/documentation/sdb/store_description.html (0.335630109753941) http://openjena.org/wiki/SDB/Database_Layouts -> http://incubator.apache.org/jena/documentation/sdb/database_layouts.html (0.745030654801725) http://openjena.org/wiki/SDB/Joseki_Integration -> http://incubator.apache.org/jena/documentation/sdb/joseki_integration.html (0.766100908909383) http://openjena.org/wiki/SDB/FAQ -> http://incubator.apache.org/jena/documentation/sdb/faq.html (0.627237320546457) http://openjena.org/wiki/SDB/Databases_Supported -> http://incubator.apache.org/jena/documentation/sdb/databases_supported.html (0.527626049308696) http://openjena.org/wiki/SDB/Support -> http://openjena.org/wiki/SDB/Loading_performance -> http://incubator.apache.org/jena/documentation/sdb/loading_performance.html (0.861511376797829) http://openjena.org/wiki/SDB/Loading_data -> http://incubator.apache.org/jena/documentation/sdb/loading_data.html (0.90451825247527) http://openjena.org/wiki/SDB/Query -> http://openjena.org/wiki/SDB/Query_performance -> http://incubator.apache.org/jena/documentation/sdb/query_performance.html (0.83331910761654) http://openjena.org/wiki/SDB/NotesPostgreSQL -> http://openjena.org/wiki/SDB/NotesMySQL -> http://openjena.org/wiki/SDB/NotesDerby -> http://openjena.org/wiki/SDB/NotesMSSQL -> http://openjena.org/wiki/SDB/NotesDB2 -> http://openjena.org/wiki/TDB/Requirements -> http://incubator.apache.org/jena/documentation/tdb/requirements.html (0.680413570081196) http://openjena.org/wiki/TDB/JVM-64-32 -> http://incubator.apache.org/jena/documentation/tdb/jvm_64_32.html (0.732310846150155) http://openjena.org/wiki/TDB/JavaAPI -> http://incubator.apache.org/jena/documentation/tdb/java_api.html (0.76568185957999) http://openjena.org/wiki/TDB/Installation -> http://openjena.org/wiki/TDB/Commands -> http://incubator.apache.org/jena/documentation/tdb/commands.html (0.72947603230751) http://openjena.org/wiki/TDB/Datasets -> http://incubator.apache.org/jena/documentation/tdb/datasets.html (0.761534103091479) http://openjena.org/wiki/TDB/QuadFilter -> http://incubator.apache.org/jena/documentation/tdb/quadfilter.html (0.831796876191203) http://openjena.org/wiki/TDB/ValueCanonicalization -> http://incubator.apache.org/jena/documentation/tdb/value_canonicalization.html (0.761273159113877) http://openjena.org/wiki/TDB/DynamicDatasets -> http://incubator.apache.org/jena/documentation/tdb/dynamic_datasets.html (0.708953988076054) http://openjena.org/wiki/TDB/Assembler -> http://incubator.apache.org/jena/documentation/tdb/assembler.html (0.889383733892633) http://openjena.org/wiki/TDB/Optimizer -> http://incubator.apache.org/jena/documentation/tdb/optimizer.html (0.941873402152256) http://openjena.org/wiki/TDB/Architecture -> http://incubator.apache.org/jena/documentation/tdb/architecture.html (0.894742673843093) http://incubator.apache.org/jena/documentation/tdb/jvm_64_32.html (0.437922833349563) http://openjena.org/wiki/TDB/Configuration -> http://incubator.apache.org/jena/documentation/tdb/configuration.html (0.807278273651406) http://incubator.apache.org/jena/documentation/sdb/configuration.html (0.377209580591843) http://openjena.org/wiki/TDB/Joseki_Integration -> http://openjena.org/wiki/ARQ/Explain -> http://openjena.org/wiki/ARQ/Manipulating_SPARQL_using_ARQ -> http://incubator.apache.org/jena/documentation/query/arq-query-eval.html (0.246126793420272) http://openjena.org/wiki/ARQ/Logging -> http://openjena.org/wiki/TDB/Concurrency ->
signature.asc
Description: Message signed with OpenPGP using GPGMail
