Re: statistical similarity of two text
On Tue, 17 Jul 2001, Cantor wrote: Does anybody know where I can find program on the website which [can] compare two texts/articles and settle whether or not they are similar assuming any significant level. Sorry, Cantor: this is not possible, in general. One can discover whether two (or more) things _differ_ (on some quantitative measure) at a specified significance level (when this is a reasonable thing to do -- it isn't always reasonable), but the formal definition of significant in statistical analysis does not permit discovering whether two (or more) things are _similar_. However, it may suffice for your purposes to discover that two things are not different enough that you can tell them apart (which is not the same thing as discovering that they are the same), on whatever measure (or set of measures) you choose to analyze. Whether this be a useful outcome or not depends heavily on how much information you have (that is, on the size of the sample available) on the things being compared. In any case, the hard part is defining the characteristics, or properties, or measures, on which the two texts/articles are to be compared. Donald F. Burrill [EMAIL PROTECTED] 184 Nashua Road, Bedford, NH 03110 603-471-7128 = Instructions for joining and leaving this list and remarks about the problem of INAPPROPRIATE MESSAGES are available at http://jse.stat.ncsu.edu/ =
Re: EdStat: Probabilistic inference in resampling?
In article [EMAIL PROTECTED], Robert J. MacG. Dawson [EMAIL PROTECTED] wrote: Herman Rubin wrote: I consider the Fisherian one to be the only relevant one. In fact, I do not think it goes far enough; at best, probability is a property of the real world like length and mass. On the contrary: length and mass are abstractions that approximately describe certain aspects of the real world (quantum mechanics demonstrates this.) They can also both be used, advan- tageously, to describe other aspects of the real world other than those that we normally associate with them (for instance, dimensional analysis makes good use of the fact that the natural dimension of capacitance is length...) I should have said that probability should be an assumed property of the real world like length and mass. Abstract models are mental; they are not derived from the real world, but rather selected if they fit. Einstein (I do not have the reference) was one of the few to see that this is the case. Probability works like that too. Mathematically probability is very simple and paradox-free; it exists only in the sense that 24-dimensional quaternionic space or the Monster Group exist. Various real-world things behave in roughly the same way: both long-run frequencies (the frequentist application of probability theory to the real world) and degrees of belief subject to certain mental disciplines that individuals may or may not adhere to (the Bayesian application). The frequentist application depends heavily on strong prior assumptions. The assumption of independent trials with the same probability is effectively untestable; at best, it is approximately true, but we act as if this approximation holds very far out in the tails. The frequentist view of testing is even more questionable. (It is undoubtedly true that in the real world a person may well believe incompatible things. The word believe is not the3 property of the statistical community and we cannot say oh no you don't to such a person. However, those whose degrees of belief in related events do *not* fit the Bayesian yoga and who are prepared to bet when the odds give them an edge according to their own beliefs can be reliably (in the long run) relieved of their stakes. This gives some objective basis to the idea of consistent belief systems.) -Robert Dawson = Instructions for joining and leaving this list and remarks about the problem of INAPPROPRIATE MESSAGES are available at http://jse.stat.ncsu.edu/ = -- This address is for information only. I do not claim that these views are those of the Statistics Department or of Purdue University. Herman Rubin, Dept. of Statistics, Purdue Univ., West Lafayette IN47907-1399 [EMAIL PROTECTED] Phone: (765)494-6054 FAX: (765)494-0558 = Instructions for joining and leaving this list and remarks about the problem of INAPPROPRIATE MESSAGES are available at http://jse.stat.ncsu.edu/ =
Re: Regression to the mean,Barry Bonds HRs
On 17 Jul 2001 15:23:29 -0700, [EMAIL PROTECTED] (dennis roberts) wrote: At 04:08 PM 7/17/01 -0400, Rich Ulrich wrote: But, so far as I have heard, the league MEANS stay the same. The SDs are the same. There is no preference, that I have ever heard, for records to be set by half-season, early or late, team or individual. My guess is that association between talent and winning (or hitting, or pitching, etc.) remains the same. dmr but, individuals are not half tests ... and, the mean # of homers in the second half is not the same (except by coincidence) to the # of home runs in the second half ... the original post was not about TEAM stats ... which are fixed in the sense that if one team loses alot ... another team wins alot ... I am not at all convinced of a difference. Sporting statistics still seem to me to be exactly the same as parallel tests in psychology. Team stats, I agree, might seem MORE or LESS parallel - more parallel, since the means have to be the same. But UP for one has to be DOWN for another. Exactly like homers, the IQ score in May is not exactly the same as January, except by coincidence. If you were scanning a group, you would expect RTTM in either instance, exactly as described by the correlation, etc. dmr every individual player could get better the 2nd half of the season ... or, get worse ... in terms of batting ... [ ... ] To be sure: I was referring to performance *in* a 2nd half, and not as statistics that are cumulative. - History says, a big change is less likely for baseball than for a psych-test. If you take your parallel IQ test several months apart, I suppose you could sprain-a-brain in the meantime. Or drop out of school and miss the test entirely. You can read in the sports page that a team has caught fire or is in the slumps. However, so far as I have seen, the longest streaks are amazingly consistent with chance. I have seen estimates, for the major US professional sports. The estimates for individual streaks are not beyond chance, either. (Some of these were in articles published in the ASA journal named Chance, over the last several years.) -- Rich Ulrich, [EMAIL PROTECTED] http://www.pitt.edu/~wpilib/index.html = Instructions for joining and leaving this list and remarks about the problem of INAPPROPRIATE MESSAGES are available at http://jse.stat.ncsu.edu/ =
Symposium 2001 of Statistics Canada
Statistics Canada is holding its 18th International Symposium on Methodological Issues from October 16 to October 19, 2001, in Hull, Quebec, just minutes away from Ottawa. The theme of Symposium 2001 is Achieving Data Quality in a Statistical Agency: a Methodological Perspective. This symposium will provide an ideal forum for statisticians, researchers, academics, data analysts and persons interested in meeting the challenges of a statistical agency to discuss, exchange knowledge and share experiences. Three workshops will be held on October 16th. They are Non-sampling Errors by Clyde Tucker of the Bureau of Labor Statistics, USA; An Overview of Statistical Quality Control Methods Used in Statistical Agencies by Walter Mudryk of Statistics Canada; and Implementing Continuous Quality Improvement in a Statistical Agency by David Marker and David Morganstein of WESTAT, USA. Visit www.statcan.ca/english/conferences/symposium2001 where you will find the Symposium 2001 Program and the Registration Form. You can also contact us by email at [EMAIL PROTECTED] Register now! = Instructions for joining and leaving this list and remarks about the problem of INAPPROPRIATE MESSAGES are available at http://jse.stat.ncsu.edu/ =
Ë÷ÎÄ·Ò빫˾-¶àÓïÖÖרҵ·Òë»ú¹¹£¬±ÊÒë¡¢¿ÚÒë¡¢ÍøÕ¾±¾µØ»¯
ÈôҪȡÏû£¬Çë»Ø¸´: [EMAIL PROTECTED] Ë÷ÎÄ·Ò빫˾-¶àÓïÖÖרҵ·Òë»ú¹¹£¬±ÊÒë¡¢¿ÚÒë¡¢ÍøÕ¾±¾µØ»¯ ¿ÉÌṩ10¶à¸öÓïÖֵĿڡ¢±ÊÒë·þÎñ£¬Í¬Ê±´ÓÊ¿Ƽ¼¡¢¾Ã³Í¼Êé¡¢µç×Ó³ö°æÎïµÄ·Òë³ö°æ¡¢»¥ÁªÍøÍøÕ¾¿ª·¢µÈÏà¹Ø·þÎñ¡£ 1.1 Ë÷ÎÄ·ÒëÓÐÏÞ¹«Ë¾ÊǾ±±¾©º£µíÁôѧÈËÔ±´´ÒµÔ°Åú×¼£¬±±¾©Êй¤ÉÌÐÐÕþ¹ÜÀí¾Ö×¢²áµÄרҵ·Ò빫˾£¬ÓÉÁôѧ¹é¹úÈËÔ±´´°ì£¬¿ÉÌṩ10¶à¸öÓïÖֵĿڡ¢±ÊÒë·þÎñ£¬Í¬Ê±´ÓÊ¿Ƽ¼¡¢¾Ã³Í¼Êé¡¢µç×Ó³ö°æÎïµÄ·Òë³ö°æ¡¢»¥ÁªÍøÍøÕ¾¿ª·¢µÈÏà¹Ø·þÎñ¡£SovenTrans ÇãÐÄÇãÁ¦½ß³ÏΪº£ÄÚÍâ¿Í»§ÌṩÓÅÖÊ·þÎñ¡£ 1.2 ·Òë¼ÛÄ¿±í Ó¢ÒëÖÐ 90£220Ôª ÖÐÒëÓ¢ 150£240Ôª Èյ·¨º«ÒâµÈÓïÖÖ£º200£280Ôª×óÓÒ(ÿǧ×Ö) 1.3ºÏ×÷ÒâÏò£º Ë÷ÎÄ·Ò빫˾ͨ¹ý»¥ÁªÍø»ã¾ÛÁËÊÀ½ç¸÷µØÊýÍòÃû¾ßÓв»Í¬ÐÐÒµ±³¾°ºÍרҵ֪ʶµÄÓïÑÔ¾«Ó¢£¬²¢ÓëÈ«Çò30¶à¼Ò×éÖ¯»ú¹¹½¨Á¢ÁËÒµÎñºÏ×÷¹Øϵ£¬ÖÂÁ¦ÓÚÌṩ¸÷ÖÖÐÎʽµÄÓïÑÔרҵ·þÎñ£¬·þÎñº¸ÇÁËÈ«Çò¶àÊýÓïÖÖ¡£ ÏÖ³ÏÖ¿Ï£ÍûÊÀ½ç¸÷µØµÄ ·Ò빫˾¡¢·ÒëлᡢÂÃÓι«Ë¾¡¢´óרԺУ¡¢³ö°æÉè¼Æ¹«Ë¾¡¢ÂÉʦÊÂÎñËù¡¢½ø³ö¿Ú¹«Ë¾µÈ»ú¹¹ÓëÎÒÃÇÁªÏµ£¬¹²Í¬Ì½ÌÖÔÚÎļþ±ÊÒë¡¢ÉÌÎñ¼°»áÒé¿ÚÒë¡¢ÍøÕ¾±¾µØ»¯¡¢ÓïÑÔÅàѵ¡¢¸÷ÀàÎÄÊé׫д¡¢ÂÃÓηþÎñ ¡¢³ö°æÉè¼ÆµÈÁìÓòÄËÖÁ¸ü¶à¸ü¹ãÁìÓòµÄºÏ×÷¡£ Çë¸øÎÒÃÇ·¢µç×ÓÓʼþ£¬ÎÒÃǽ«¼°Ê±ÓëÄúÁªÏµ¡£¸øÎÒÃÇÒ»¸ö»ú»á£¬¸øÄúÒ»¸ö»ú»á£¬Ï£ÍûÕâÄܳÉΪÎÒÃdz¤ÆÚºÏ×÷¡¢×ßÏò¹²Í¬»Ô»ÍµÄÔ¿³×¡£ ÁªÏµÒµÎñ£¬Çë»Ø¸´£º[EMAIL PROTECTED] ¼ÙÈç¸øÄú´øÀ´²»±ã£¬Çë»Ø¸´: [EMAIL PROTECTED] ÎÒÃÇ°ïÄúɾ³ý µ¥Î»: ±±¾©Ë÷ÎÄ·ÒëÓÐÏÞ¹«Ë¾ µç»°: 0086-10-62988488 Óʱà: 100085 µØÖ·: Öйú±±¾©Éϵض«7Çø3Â¥4-502ÊÒ ´«Õæ: 0086-10-62978488 ˽µç: 13611234157 ÍøÖ·: www.soven.com ÐÕÃû: Å˾°»ª ºô»ú: È«¹úÁªÍø 198-10033237 ÓÊÏä: [EMAIL PROTECTED] ÖйúÒøÐÐÉϵطÖÀí´¦ Õʺţº06909700517001 1.1 Beijing Soven Translation Co., Ltd., a specialized translation organization approved The Iicubator of Beijing Haidian Science Park by Beijing science Technology Commission and registered with Shanghai Industrial Commercial Administration Bureau, sincerely provides quality service for customers both at home and abroad. SovenTrans boasts over two hundred professional translators and interpreters covering more than ten languages and has established a broad translation network in China. 1.2 Soven in search of Excellence, Trust us, guaranteed content, logic, completeness, accuracy and consistency in matching your original documents to your intended audience, Overcome Chinglish with Quality Translation, this is SovenTrans. Welcome to browse our Web-site, http://www.soven.com , or E-mail to us. 1.3 Quotation for translation Non-technical (RMB/thousand Chinese characters) Technical (RMB/thousand Chinese characters) Contract (RMB/thousand Chinese characters) Chinese¡úEnglish or Japanese 150-240 180-260 200-240 English or Japanese¡úChinese 90-180 95-220 160-220 Chinese¡úGerman or French 220 260 300 German or French ¡úChinese 180 220 260 Chinese¡úother languages 320 360 400 Other languages¡ú Chinese 240 280 320 1. 4 Seeking Partners: Beijing Soven Translation Co., Ltd. is a global company with branches in America, Asia, Europe and Australia, successfully managing a network of over 10,000 linguistic experts from all over the world who work closely via the Internet, and providing multilingual linguistic services. We have established cooperation relations with more than 30 institutions and organizations worldwide. We are sincerely seeking partners like translation companies, translation institutes, tourism companies, colleges, publishing and designing companies, law offices as well as import export companies to cooperate in the fields of document translation, business and conference interpretation, website localization, language training, tourist guide, publishing and designing, etc. Please contact us. 1.5 we can send emails for your company at the method of group-sending emails, RMB Yuan 100/ 100 thousand email 1.6 Address: Room 502,Entrance 4, Building 3, Zone No. 7, Shangdi Dongli, Haidian District, Beijing, China Post code: 100085 Tel: +86 10-62988488,62978488,13611234157 Fax: +86 10-62978488 BP:95838-50031 Website: http://www.soven.com E-mail: [EMAIL PROTECTED] , [EMAIL PROTECTED] 1.1 Soven Translation Êǽ±±¾©Êй¤ÉÌÐÐÕþ¹ÜÀí¾ÖעԵģIÐÔ·×gC£¬CTCAÐÄAÁ¦½ßÕ\麣ÈÍâ¿ÍôÌṩÙ|·þÕ¡£È˲Åúú£¬ÖÁ½ñÒÑ R¼¯Á˶þ°ÙðNÃû£I¿Ú¡¢¹P×gÈËT£¬·×gÕZ·NÉæ¼°µÊ®·N£¬KÔÚÖÐøÈA|µØ ^½¨Á¢ÆðÁËÒ»´óµÄ·×g¾W½j¡£ÒÔÙ|Á¿ÇóÉú´æ£¬ÒÔÐÅ×ué±£Ô^¡£ÔÚÐÐIÈÂÊÏÈñÓÃISO9000Ë×¼ßMÐÐÙ|Á¿¼°Á÷³Ì¹ÜÀí£¬×î´ó³Ì¶ÈµØÌá¸ßÁË·×gÙ|Á¿ºÍ·×gЧÂÊ¡£ 1.2 ·×grÄ¿±í (RMB Yuan) ÖÐ×gÓ¢¡¢ÈÕ 160Ôª/ǧ×Ö Ó¢¡¢ÈÕ×gÖÐ 90-120Ôª/ǧ×Ö ÆäËûÕZ·N»¥×g240-300Ôª/ǧ×Ö 1.3 ºÏ×÷ÒâÏò£º CTC·×gÖÐÐÄͨß^»¥Â¾W R¾ÛÁËÊÀ½ç¸÷µØµÈfÃû¾ßÓв»Í¬ÐÐI±³¾°ºÍ£IÖª×RµÄÕZÑÔ¾«Ó¢£¬KÅcÈ«Çò30¶à¼Ò½M¿C½¨Á¢ÁËIÕºÏ×÷êPϵ£¬ÖÂÁ¦ì¶Ìṩ¸÷·NÐÎʽµÄÕZÑÔ£I·þÕ£¬·þÕºÉwÁËÈ«Çò¶àµÕZ·N¡£ ¬FÕ\´Ï£ÍûÊÀ½ç¸÷µØµÄ ·×g¹«Ë¾¡¢·×g fþ¡¢ÂÃß[¹«Ë¾¡¢´ó£ÔºÐ£¡¢³ö°æÔOÓ¹«Ë¾¡¢ÂÉÊÂÕËù¡¢ßM³ö¿Ú¹«Ë¾µÈCÅcÎÒÂϵ£¬¹²Í¬Ì½ÓÔÚÎļþ¹P×g¡¢ÉÌÕ¼°þ×h¿Ú×g¡¢¾WÕ¾±¾µØ»¯¡¢ÕZÑÔÅàÓ¡¢¸÷îÎÄø׫¡¢ÂÃß[·þÕ ¡¢³ö°æÔOÓµÈîIÓòÄËÖÁ¸ü¶à¸üVîIÓòµÄºÏ×÷¡£ 1.4