Hi there,

Cory and I have been discussing about how to work on the database,  
based on the previous discussions on the list. Before start working on  
the time consuming part, we'd like to see what you guys think.

This could be the procedure to "clean" a set of games to be included:

1.- Add Source tag (e.g. [Source "Pitt Chess Archive - Events"] or  
[Source "The Week In Chess - 211"] ).
2.- Import the games to a Scid DB.
3.- Use the "Cleaner" feature to do automatic spelling check, add eco,  
add elo and delete exact twin games.*
4.- Resolve ambiguous names manually, when it can be done easily.
5.- Manually delete remaining twin games.**

* "Exact twin games" refers to games found checking all the options in  
the Delete twin games window.

** To find this twins the default options are used.

Doing some search and using the Internet Archive one can get rid of  
some ambiguous names, but it is time consuming. So what is the  
importance of ambiguous names? should we try to get rid of them all no  
matter the time consumed? should we delete the games with ambiguous  
names? or should we do a reasonable effort to get rid of them and  
leave it there?

What settings should we use on the step 5? what do you guys use?

So, what do you think?

Cheers,
Israel

-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
_______________________________________________
Scid-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scid-users

Reply via email to