Hi there, Cory and I have been discussing about how to work on the database, based on the previous discussions on the list. Before start working on the time consuming part, we'd like to see what you guys think.
This could be the procedure to "clean" a set of games to be included: 1.- Add Source tag (e.g. [Source "Pitt Chess Archive - Events"] or [Source "The Week In Chess - 211"] ). 2.- Import the games to a Scid DB. 3.- Use the "Cleaner" feature to do automatic spelling check, add eco, add elo and delete exact twin games.* 4.- Resolve ambiguous names manually, when it can be done easily. 5.- Manually delete remaining twin games.** * "Exact twin games" refers to games found checking all the options in the Delete twin games window. ** To find this twins the default options are used. Doing some search and using the Internet Archive one can get rid of some ambiguous names, but it is time consuming. So what is the importance of ambiguous names? should we try to get rid of them all no matter the time consumed? should we delete the games with ambiguous names? or should we do a reasonable effort to get rid of them and leave it there? What settings should we use on the step 5? what do you guys use? So, what do you think? Cheers, Israel ------------------------------------------------------------------------- This SF.Net email is sponsored by the Moblin Your Move Developer's challenge Build the coolest Linux based applications with Moblin SDK & win great prizes Grand prize is a trip for two to an Open Source event anywhere in the world http://moblin-contest.org/redirect.php?banner_id=100&url=/ _______________________________________________ Scid-users mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/scid-users
