Hi All, I need help so badly , I hope you would understand my situation
The use case is, I have one folder which has multiple XML files and I need to write a PIG script which recursively parse all the files and generate one flat file. The XML looks like this and each XML file has different clinical_study_rank such as *clinical_study rank="687"* *<?xml version="1.0" encoding="UTF-8"?>* *<clinical_study rank="687">* * <!-- This xml conforms to an XML Schema at:* * http://clinicaltrials.gov/ct2/html/images/info/public.xsd <http://clinicaltrials.gov/ct2/html/images/info/public.xsd>* * and an XML DTD at:* * http://clinicaltrials.gov/ct2/html/images/info/public.dtd <http://clinicaltrials.gov/ct2/html/images/info/public.dtd> -->* * <required_header>* * <download_date>ClinicalTrials.gov processed this data on November 07, 2013</download_date>* * <link_text>Link to the current ClinicalTrials.gov record.</link_text>* * <url>http://clinicaltrials.gov/show/NCT00000611 <http://clinicaltrials.gov/show/NCT00000611></url>* * </required_header>* * <id_info>* * <org_study_id>114</org_study_id>* * <nct_id>NCT00000611</nct_id>* * </id_info>* * <brief_title>Women's Health Initiative (WHI)</brief_title>* * <sponsors>* * <lead_sponsor>* * <agency>National Heart, Lung, and Blood Institute (NHLBI)</agency>* * <agency_class>NIH</agency_class>* * </lead_sponsor>* * <collaborator>* * <agency>National Institute of Arthritis and Musculoskeletal and Skin Diseases (NIAMS)</agency>* * <agency_class>NIH</agency_class>* * </collaborator>* * <collaborator>* * <agency>National Cancer Institute (NCI)</agency>* * <agency_class>NIH</agency_class>* * </collaborator>* * <collaborator>* * <agency>National Institute on Aging (NIA)</agency>* * <agency_class>NIH</agency_class>* * </collaborator>* * </sponsors>* * <source>National Heart, Lung, and Blood Institute (NHLBI)</source>* * <oversight_info>* * <authority>United States: Federal Government</authority>* * </oversight_info>* * <brief_summary>* * <textblock>* * To address cardiovascular disease, cancer, and osteoporosis, the most common causes of* * death, disability, and impaired quality of life in postmenopausal women. The three major* * components of the WHI are: a randomized controlled clinical trial of hormone replacement* * therapy (HRT), dietary modification (DM), and calcium/vitamin D supplementation (CaD); an* * observational study (OS); and a community prevention study (CPS). On October 1, 1997,* * administration of the WHI was transferred to the NHLBI where it is conducted as a consortium* * effort led by the NHLBI in cooperation with the National Institute of Arthritis and* * Musculoskeletal and Skin Diseases (NIAMS), the National Cancer Institute (NCI), and the* * National Institute on Aging (NIA).* * </textblock>* * </brief_summary>* * <detailed_description>* * <textblock>* * BACKGROUND:* * Prior to 1991, little research had focused on health issues unique to, or more common for,* * women. This was especially the case for studies of chronic diseases and their prevention in* * mature women. These conditions (coronary heart disease, cancer, and osteoporosis) are the* * leading causes of impairment of quality of life, morbidity, and mortality in post-menopausal* * United States women. The WHI, mandated by Congress, was established in 1991 by the National* * Institutes of Health and located in the Office of the Director (OD). The Clinical* * Coordinating Center for the clinical trial/observational study was funded in September 1992* * and the 16 Vanguard Clinical Centers were funded in March 1993. The initial protocol was* * developed jointly by the Clinical Coordinating Center and the Program Office and was* * reviewed and approved by the Investigators Committee on April 20, 1993. Additional clinical* * centers were funded in 1994.* * On October 1, 1997, administration of the WHI was transferred to the NHLBI where it is* * conducted as a consortium effort led by the NHLBI in cooperation with the National Institute* * of Arthritis and Musculoskeletal and Skin Diseases (NIAMS), the National Cancer Institute* * (NCI), and the National Institute on Aging (NIA).* * DESIGN NARRATIVE:* * As has been described in the objective, the WHI has three major components: a randomized* * controlled clinical trial, an observational study, and a study of community approaches to* * developing healthful behaviors. Recruitment for the WHI began in September 1993 and ended* * in December 1998. Six clinical centers completed recruitment in January 1997. The remaining* * 34 centers completed recruitment in December 1998.* * CLINICAL TRIAL COMPONENT* * The clinical trial component consists of three subtrials: the hormone replacement trial, the* * dietary modification trial, and the calcium /vitamin D supplementation trial. Approximately* * 27,500 women aged 50 to 79 are participating in the HRT, which tests whether long-term HRT* * reduces coronary heart disease and fractures without increasing breast cancer risk. Women* * with a uterus were randomized to receive either estrogen plus progestin or a placebo.* * Progestin was added to protect women with a uterus from endometrial cancer. Women who have* * had a hysterectomy were randomized to receive either estrogen alone or a placebo. The* * estrogen plus progestin trial was stopped early on July 8, 2002 after an average follow-up* * of 5.2 years on the recommendation of the Data and Safety Monitoring Board. The estrogen* * alone study continued unchanged until March 2, 2004 when the NIH instructed participants to* * stop taking their study pills and to begin the follow-up phase of the study. . Participants* * in the estrogen alone study will be followed for eight to 12 years and have clinic visits* * every six months to assure safety and assess their health.* * The dietary modification trial component studies the effect of a low-fat, high fruit,* * vegetable and grain diet on breast cancer, colorectal cancer and heart disease in 48,000* * postmenopausal women. Participants are randomized to a comparison group which maintains* * usual dietary habits or to a dietary change group. Women in the dietary change group* * decrease their fat intake to 20 percent of total daily calories, increase fruit and* * vegetable consumption to five or more servings per day, and increase grains to six or more* * servings per day. Additionally, they monitor their food intake and attend nutrition group* * meetings to learn more about changing their diets in the first year. Thereafter they attend* * four meetings per year.* * The calcium/vitamin D supplementation subtrial tests whether calcium and vitamin D* * supplements reduce the risk of hip and other fractures and colorectal cancer in* * postmenopausal women. Women in the hormone replacement therapy and the dietary modification* * trials are encouraged to join the calcium/vitamin D supplementation study. Approximately* * 45,000 postmenopausal women are randomized into one of two study groups. One group was* * randomly assigned to receive 1,000 mg of elemental calcium (as calcium carbonate) and 400* * International Units of vitamin Dâdaily. The second group received a matching placebo.* * Women already taking calcium supplements can continue to take them. Participants will be* * followed for eight to 11 years and contacted by their clinical center every six months to* * assure their safety and assess their health.* * Total number of trial participants in all three subtrials is 68,135.* * OBSERVATIONAL STUDY* * The several goals of the study include: to give reliable estimates of extent to which known* * risk factors predict heart disease, cancers, and fractures; to identify new risk factors for* * these and other diseases in women; to compare risk factors, presence of disease at the start* * of the study and new occurrences of disease during the WHI in all study components; and to* * create a future resource to identify biological indicators of disease, especially substances* * and factors found in blood. The study enrolled 93,726 postmenopausal women and will track* * them for an average of nine years. Participants fill out periodic health forms and visit* * the clinic three years after enrollment. They take no medication and do not change their* * health habits.* * COMMUNITY PREVENTION STUDY* * The community prevention study consists of 12 separate studies conducted at eight of the* * Centers for Disease Control and Prevention's (CDC) University-based Prevention Research* * Centers through a cooperative agreement between NIH and CDC. The 12 studies began in* * October 1995 and continue for an additional five years. The collaboration supports health* * promotion and disease prevention research and demonstration projects that are* * community-based and focus on healthy behaviors that prevent the major causes of death and* * disability and that promote health practices that lead to more effective public health* * interventions. Each project provides research dissemination and translation of findings* * into community interventions. Topics under study include: attitudes towards hysterectomy,* * oophorectomy, and surgical menopause among African Americans; reducing cardiovascular* * disease risk among Black women; environmental and policy interventions to increase physical* * activity among minority women ages 40 to 75; peer support intervention for cardiovascular* * disease risk among African American women, aged 40 and older; assessing the effectiveness of* * a brief medical-provider educational intervention for osteoporosis in minority women aged 40* * and older; improving the delivery of diabetes care to women in minority groups; and* * assessment of moderate physical activity among women.* * </textblock>* * </detailed_description>* * <overall_status>Completed</overall_status>* * <phase>Phase 3</phase>* * <study_type>Interventional</study_type>* * <study_design>Allocation: Randomized, Primary Purpose: Prevention</study_design>* * <condition>Bone Diseases</condition>* * <condition>Breast Neoplasms</condition>* * <condition>Cardiovascular Diseases</condition>* * <condition>Colonic Neoplasms</condition>* * <condition>Coronary Disease</condition>* * <condition>Heart Diseases</condition>* * <condition>Myocardial Ischemia</condition>* * <condition>Osteoporosis</condition>* * <condition>Postmenopause</condition>* * <intervention>* * <intervention_type>Drug</intervention_type>* * <intervention_name>hormone replacement therapy</intervention_name>* * </intervention>* * <intervention>* * <intervention_type>Drug</intervention_type>* * <intervention_name>estrogens</intervention_name>* * </intervention>* * <intervention>* * <intervention_type>Drug</intervention_type>* * <intervention_name>progestins</intervention_name>* * </intervention>* * <intervention>* * <intervention_type>Drug</intervention_type>* * <intervention_name>estrogen replacement therapy</intervention_name>* * </intervention>* * <intervention>* * <intervention_type>Behavioral</intervention_type>* * <intervention_name>diet, fat-restricted</intervention_name>* * </intervention>* * <intervention>* * <intervention_type>Drug</intervention_type>* * <intervention_name>calcium</intervention_name>* * </intervention>* * <intervention>* * <intervention_type>Drug</intervention_type>* * <intervention_name>vitamin D</intervention_name>* * </intervention>* * <intervention>* * <intervention_type>Behavioral</intervention_type>* * <intervention_name>dietary supplements</intervention_name>* * </intervention>* * <eligibility>* * <criteria>* * <textblock>* * Postmenopausal women ages 50 to 79.* * </textblock>* * </criteria>* * <gender>Female</gender>* * <minimum_age>50 Years</minimum_age>* * <maximum_age>79 Years</maximum_age>* * <healthy_volunteers>No</healthy_volunteers>* * </eligibility>* * <overall_official>* * <last_name>Ross Prentice</last_name>* * <affiliation>Fred Hutchinson Cancer Research Center</affiliation>* * </overall_official>* * <link>* * <url>http://www.nhlbi.nih.gov/whi/ <http://www.nhlbi.nih.gov/whi/></url>* * </link>* * <reference>* * <citation>Rossouw JE, Hurd S. The Women's Health Initiative: recruitment complete--looking back and looking forward. J Womens Health. 1999 Jan-Feb;8(1):3-5. No abstract available.</citation>* * <PMID>10094073</PMID>* * </reference>* * <reference>* * <citation>[No authors listed] Design of the Women's Health Initiative clinical trial and observational study. The Women's Health Initiative Study Group. Control Clin Trials. 1998 Feb;19(1):61-109.</citation>* * <PMID>9492970</PMID>* * </reference>* * <reference>* * <citation>Patterson RE, Kristal AR, Tinker LF, Carter RA, Bolton MP, Agurs-Collins T. Measurement characteristics of the Women's Health Initiative food frequency questionnaire. Ann Epidemiol. 1999 Apr;9(3):178-87.</citation>* * <PMID>10192650</PMID>* * </reference>* * <reference>* * <citation>McGowan JA, Pottern L. Commentary on the Women's Health Initiative. Maturitas. 2000 Feb 15;34(2):109-12.</citation>* * <PMID>10714904</PMID>* * </reference>* * <reference>* * <citation>Wassertheil-Smoller S, Anderson G, Psaty BM, Black HR, Manson J, Wong N, Francis J, Grimm R, Kotchen T, Langer R, Lasser N. Hypertension and its treatment in postmenopausal women: baseline data from the Women's Health Initiative. Hypertension. 2000 Nov;36(5):780-9.</citation>* * <PMID>11082143</PMID>* * </reference>* * <reference>* * <citation>Wilcox S, Shumaker SA, Bowen DJ, Naughton MJ, Rosal MC, Ludlam SE, Dugan E, Hunt JR, Stevens S. Promoting adherence and retention to clinical trials in special populations: a women's health initiative workshop. Control Clin Trials. 2001 Jun;22(3):279-89.</citation>* * <PMID>11384790</PMID>* * </reference>* * <reference>* * <citation>Gottesman R. Medical care for minors without parental consent. Child Today. 1975 Mar-Apr;4(2):30-1. No abstract available.</citation>* * <PMID>1116413</PMID>* * </reference>* * <reference>* * <citation>Hsia J, Kemper E, Sofaer S, Bowen D, Kiefe CI, Zapka J, Mason E, Lillington L, Limacher M. Is insurance a more important determinant of healthcare access than perceived health? Evidence from the Women's Health Initiative. J Womens Health Gend Based Med. 2000 Oct;9(8):881-9.</citation>* * <PMID>11074954</PMID>* * </reference>* * <reference>* * <citation>Valanis BG, Bowen DJ, Bassford T, Whitlock E, Charney P, Carter RA. Sexual orientation and health: comparisons in the women's health initiative sample. Arch Fam Med. 2000 Sep-Oct;9(9):843-53.</citation>* * <PMID>11031391</PMID>* * </reference>* * <reference>* * <citation>Hsia J, Kemper E, Kiefe C, Zapka J, Sofaer S, Pettinger M, Bowen D, Limacher M, Lillington L, Mason E. The importance of health insurance as a determinant of cancer screening: evidence from the Women's Health Initiative. Prev Med. 2000 Sep;31(3):261-70.</citation>* * <PMID>10964640</PMID>* * </reference>* * <reference>* * <citation>Larkey LK, Staten LK, Ritenbaugh C, Hall RA, Buller DB, Bassford T, Altimari BR. Recruitment of Hispanic women to the Women's Health Initiative. the case of Embajadoras in Arizona. Control Clin Trials. 2002 Jun;23(3):289-98.</citation>* * <PMID>12057880</PMID>* * </reference>* * <reference>* * <citation>Tinker LF, Perri MG, Patterson RE, Bowen DJ, McIntosh M, Parker LM, Sevick MA, Wodarski LA. The effects of physical and emotional status on adherence to a low-fat dietary pattern in the Women's Health Initiative. J Am Diet Assoc. 2002 Jun;102(6):789-800, 888.</citation>* * <PMID>12067044</PMID>* * </reference>* * <reference>* * <citation>Hendrix SL, Clark A, Nygaard I, Aragaki A, Barnabei V, McTiernan A. Pelvic organ prolapse in the Women's Health Initiative: gravity and gravidity. Am J Obstet Gynecol. 2002 Jun;186(6):1160-6.</citation>* * <PMID>12066091</PMID>* * </reference>* * <reference>* * <citation>[No authors listed] Risks and benefits of estrogen plus progestin in healthy postmenopausal women: principal results From the Women's Health Initiative randomized controlled trial. JAMA. 2002 Jul 17;288(3):321-33.</citation>* * <PMID>12117397</PMID>* * </reference>* * <reference>* * <citation>Fletcher SW, Colditz GA. Failure of estrogen plus progestin therapy for prevention. JAMA. 2002 Jul 17;288(3):366-8. No abstract available.</citation>* * <PMID>12117403</PMID>* * </reference>* * <reference>* * <citation>Pradhan AD, Manson JE, Rossouw JE, Siscovick DS, Mouton CP, Rifai N, Wallace RB, Jackson RD, Pettinger MB, Ridker PM. Inflammatory biomarkers, hormone replacement therapy, and incident coronary heart disease: prospective analysis from the Women's Health Initiative observational study. JAMA. 2002 Aug 28;288(8):980-7.</citation>* * <PMID>12190368</PMID>* * </reference>* * <reference>* * <citation>Manson JE, Greenland P, LaCroix AZ, Stefanick ML, Mouton CP, Oberman A, Perri MG, Sheps DS, Pettinger MB, Siscovick DS. Walking compared with vigorous exercise for the prevention of cardiovascular events in women. N Engl J Med. 2002 Sep 5;347(10):716-25.</citation>* * <PMID>12213942</PMID>* * </reference>* * <reference>* * <citation>Shikany JM, Patterson RE, Agurs-Collins T, Anderson G. Antioxidant supplement use in Women's Health Initiative participants. Prev Med. 2003 Mar;36(3):379-87.</citation>* * <PMID>12634029</PMID>* * </reference>* * <reference>* * <citation>Hays J, Ockene JK, Brunner RL, Kotchen JM, Manson JE, Patterson RE, Aragaki AK, Shumaker SA, Brzyski RG, LaCroix AZ, Granek IA, Valanis BG; Women's Health Initiative Investigators. Effects of estrogen plus progestin on health-related quality of life. N Engl J Med. 2003 May 8;348(19):1839-54. Epub 2003 Mar 17.</citation>* * <PMID>12642637</PMID>* * </reference>* * <reference>* * <citation>Rapp SR, Espeland MA, Shumaker SA, Henderson VW, Brunner RL, Manson JE, Gass ML, Stefanick ML, Lane DS, Hays J, Johnson KC, Coker LH, Dailey M, Bowen D; WHIMS Investigators. Effect of estrogen plus progestin on global cognitive function in postmenopausal women: the Women's Health Initiative Memory Study: a randomized controlled trial. JAMA. 2003 May 28;289(20):2663-72.</citation>* * <PMID>12771113</PMID>* * </reference>* * <reference>* * <citation>Shumaker SA, Legault C, Thal L, Wallace RB, Ockene JK, Hendrix SL, Jones BN 3rd, Assaf AR, Jackson RD, Kotchen JM, Wassertheil-Smoller S, Wactawski-Wende J; WHIMS Investigators. Estrogen plus progestin and the incidence of dementia and mild cognitive impairment in postmenopausal women: the Women's Health Initiative Memory Study: a randomized controlled trial. JAMA. 2003 May 28;289(20):2651-62.</citation>* * <PMID>12771112</PMID>* * </reference>* * <reference>* * <citation>Wassertheil-Smoller S, Hendrix SL, Limacher M, Heiss G, Kooperberg C, Baird A, Kotchen T, Curb JD, Black H, Rossouw JE, Aragaki A, Safford M, Stein E, Laowattana S, Mysiw WJ; WHI Investigators. Effect of estrogen plus progestin on stroke in postmenopausal women: the Women's Health Initiative: a randomized trial. JAMA. 2003 May 28;289(20):2673-84.</citation>* * <PMID>12771114</PMID>* * </reference>* * <reference>* * <citation>Chlebowski RT, Hendrix SL, Langer RD, Stefanick ML, Gass M, Lane D, Rodabough RJ, Gilligan MA, Cyr MG, Thomson CA, Khandekar J, Petrovitch H, McTiernan A. Influence of Estrogen Plus Progestin on Breast Cancer and Mammography in Healthy Postmenopausal Women: The Women's Health Initiative Randomized Trial. JAMA. 2003 Jun 25;289(24):3243-53.</citation>* * <PMID>12824205</PMID>* * </reference>* * <reference>* * <citation>Gann PH, Morrow M. Combined hormone therapy and breast cancer: a single-edged sword. JAMA. 2003 Jun 25;289(24):3304-6. No abstract available.</citation>* * <PMID>12824214</PMID>* * </reference>* * <reference>* * <citation>Jackson M, Berman N, Huber M, Snetselaar L, Granek I, Boe K, Milas C, Spivak J, Chlebowski RT. Research staff turnover and participant adherence in the Women's Health Initiative. Control Clin Trials. 2003 Aug;24(4):422-35.</citation>* * <PMID>12865036</PMID>* * </reference>* * <reference>* * <citation>Howard BV, Criqui MH, Curb JD, Rodabough R, Safford MM, Santoro N, Wilson AC, Wylie-Rosett J. Risk factor clustering in the insulin resistance syndrome and its relationship to cardiovascular disease in postmenopausal white, black, hispanic, and Asian/Pacific Islander women. Metabolism. 2003 Mar;52(3):362-71.</citation>* * <PMID>12647277</PMID>* * </reference>* * <reference>* * <citation>Hsia J, Barad D, Margolis K, Rodabough R, McGovern PG, Limacher MC, Oberman A, Smoller S; Women's Health Initiative Research Group. Usefulness of prior hysterectomy as an independent predictor of Framingham risk score (The Women's Health Initiative). Am J Cardiol. 2003 Aug 1;92(3):264-9.</citation>* * <PMID>12888128</PMID>* * </reference>* * <reference>* * <citation>LaCroix AZ, Cauley JA, Pettinger M, Hsia J, Bauer DC, McGowan J, Chen Z, Lewis CE, McNeeley SG, Passaro MD, Jackson RD. Statin use, clinical fracture, and bone density in postmenopausal women: results from the Women's Health Initiative Observational Study. Ann Intern Med. 2003 Jul 15;139(2):97-104.</citation>* * <PMID>12859159</PMID>* * </reference>* * <reference>* * <citation>Cauley JA, Robbins J, Chen Z, Cummings SR, Jackson RD, LaCroix AZ, LeBoff M, Lewis CE, McGowan J, Neuner J, Pettinger M, Stefanick ML, Wactawski-Wende J, Watts NB; Women's Health Initiative Investigators. Effects of estrogen plus progestin on risk of fracture and bone mineral density: the Women's Health Initiative randomized trial. JAMA. 2003 Oct 1;290(13):1729-38.</citation>* * <PMID>14519707</PMID>* * </reference>* * <reference>* * <citation>Anderson GL, Judd HL, Kaunitz AM, Barad DH, Beresford SA, Pettinger M, Liu J, McNeeley SG, Lopez AM; Women's Health Initiative Investigators. Effects of estrogen plus progestin on gynecologic cancers and associated diagnostic procedures: the Women's Health Initiative randomized trial. JAMA. 2003 Oct 1;290(13):1739-48.</citation>* * <PMID>14519708</PMID>* * </reference>* * <reference>* * <citation>Chen Z, Pettinger MB, Ritenbaugh C, LaCroix AZ, Robbins J, Caan BJ, Barad DH, Hakim IA. Habitual tea consumption and risk of osteoporosis: a prospective study in the women's health initiative observational cohort. Am J Epidemiol. 2003 Oct 15;158(8):772-81.</citation>* * <PMID>14561667</PMID>* * </reference>* * <reference>* * <citation>Hsia J, Criqui MH, Rodabough RJ, Langer RD, Resnick HE, Phillips LS, Allison M, Bonds DE, Masaki K, Caralis P, Kotchen JM; Women's Health Initiative Investigators. Estrogen plus progestin and the risk of peripheral arterial disease: the Women's Health Initiative. Circulation. 2004 Feb 10;109(5):620-6.</citation>* * <PMID>14769684</PMID>* * </reference>* * <reference>* * <citation>Chlebowski RT, Wactawski-Wende J, Ritenbaugh C, Hubbell FA, Ascensao J, Rodabough RJ, Rosenberg CA, Taylor VM, Harris R, Chen C, Adams-Campbell LL, White E; Women's Health Initiative Investigators. Estrogen plus progestin and colorectal cancer in postmenopausal women. N Engl J Med. 2004 Mar 4;350(10):991-1004.</citation>* * <PMID>14999111</PMID>* * </reference>* * <reference>* * <citation>Hebert JR, Patterson RE, Gorfine M, Ebbeling CB, St Jeor ST, Chlebowski RT. Differences between estimated caloric requirements and self-reported caloric intake in the women's health initiative. Ann Epidemiol. 2003 Oct;13(9):629-37.</citation>* * <PMID>14732302</PMID>* * </reference>* * <reference>* * <citation>Ritenbaugh C, Patterson RE, Chlebowski RT, Caan B, Fels-Tinker L, Howard B, Ockene J. The Women's Health Initiative Dietary Modification trial: overview and baseline characteristics of participants. Ann Epidemiol. 2003 Oct;13(9 Suppl):S87-97. No abstract available.</citation>* * <PMID>14575941</PMID>* * </reference>* * <reference>* * <citation>Harris RE, Chlebowski RT, Jackson RD, Frid DJ, Ascenseo JL, Anderson G, Loar A, Rodabough RJ, White E, McTiernan A; Women's Health Initiative. Breast cancer and nonsteroidal anti-inflammatory drugs: prospective results from the Women's Health Initiative. Cancer Res. 2003 Sep 15;63(18):6096-101.</citation>* * <PMID>14522941</PMID>* * </reference>* * <reference>* * <citation>Morimoto LM, White E, Chen Z, Chlebowski RT, Hays J, Kuller L, Lopez AM, Manson J, Margolis KL, Muti PC, Stefanick ML, McTiernan A. Obesity, body size, and risk of postmenopausal breast cancer: the Women's Health Initiative (United States). Cancer Causes Control. 2002 Oct;13(8):741-51.</citation>* * <PMID>12420953</PMID>* * </reference>* * <reference>* * <citation>Hsia J, Aragaki A, Bloch M, LaCroix AZ, Wallace R; WHI Investigators. Predictors of angina pectoris versus myocardial infarction from the Women's Health Initiative Observational Study. Am J Cardiol. 2004 Mar 15;93(6):673-8.</citation>* * <PMID>15019867</PMID>* * </reference>* * <reference>* * <citation>Khurana C, Rosenbaum CG, Howard BV, Adams-Campbell LL, Detrano RC, Klouj A, Hsia J. Coronary artery calcification in black women and white women. Am Heart J. 2003 Apr;145(4):724-9.</citation>* * <PMID>12679771</PMID>* * </reference>* * <reference>* * <citation>Anderson GL, Limacher M, Assaf AR, Bassford T, Beresford SA, Black H, Bonds D, Brunner R, Brzyski R, Caan B, Chlebowski R, Curb D, Gass M, Hays J, Heiss G, Hendrix S, Howard BV, Hsia J, Hubbell A, Jackson R, Johnson KC, Judd H, Kotchen JM, Kuller L, LaCroix AZ, Lane D, Langer RD, Lasser N, Lewis CE, Manson J, Margolis K, Ockene J, O'Sullivan MJ, Phillips L, Prentice RL, Ritenbaugh C, Robbins J, Rossouw JE, Sarto G, Stefanick ML, Van Horn L, Wactawski-Wende J, Wallace R, Wassertheil-Smoller S; Women's Health Initiative Steering Committee. Effects of conjugated equine estrogen in postmenopausal women with hysterectomy: the Women's Health Initiative randomized controlled trial. JAMA. 2004 Apr 14;291(14):1701-12.</citation>* * <PMID>15082697</PMID>* * </reference>* * <reference>* * <citation>Hulley SB, Grady D. The WHI estrogen-alone trial--do things look any better? JAMA. 2004 Apr 14;291(14):1769-71. No abstract available.</citation>* * <PMID>15082705</PMID>* * </reference>* * <reference>* * <citation>Espeland MA, Rapp SR, Shumaker SA, Brunner R, Manson JE, Sherwin BB, Hsia J, Margolis KL, Hogan PE, Wallace R, Dailey M, Freeman R, Hays J; Women's Health Initiative Memory Study. Conjugated equine estrogens and global cognitive function in postmenopausal women: Women's Health Initiative Memory Study. JAMA. 2004 Jun 23;291(24):2959-68.</citation>* * <PMID>15213207</PMID>* * </reference>* * <reference>* * <citation>Shumaker SA, Legault C, Kuller L, Rapp SR, Thal L, Lane DS, Fillit H, Stefanick ML, Hendrix SL, Lewis CE, Masaki K, Coker LH; Women's Health Initiative Memory Study. Conjugated equine estrogens and incidence of probable dementia and mild cognitive impairment in postmenopausal women: Women's Health Initiative Memory Study. JAMA. 2004 Jun 23;291(24):2947-58.</citation>* * <PMID>15213206</PMID>* * </reference>* * <reference>* * <citation>Schneider LS. Estrogen and dementia: insights from the Women's Health Initiative Memory Study. JAMA. 2004 Jun 23;291(24):3005-7. No abstract available.</citation>* * <PMID>15213214</PMID>* * </reference>* * <reference>* * <citation>Pradhan AD, LaCroix AZ, Langer RD, Trevisan M, Lewis CE, Hsia JA, Oberman A, Kotchen JM, Ridker PM. Tissue plasminogen activator antigen and D-dimer as markers for atherothrombotic risk among healthy postmenopausal women. Circulation. 2004 Jul 20;110(3):292-300. Epub 2004 Jul 06.</citation>* * <PMID>15238458</PMID>* * </reference>* * <reference>* * <citation>Fouad MN, Corbie-Smith G, Curb D, Howard BV, Mouton C, Simon M, Talavera G, Thompson J, Wang CY, White C, Young R. Special populations recruitment for the Women's Health Initiative: successes and limitations. Control Clin Trials. 2004 Aug;25(4):335-52.</citation>* * <PMID>15296809</PMID>* * </reference>* * <reference>* * <citation>Women's Health Initiative Steering Committee. Effects of Conjugated Equine Estrogen in Postmenopausal Women Having Undergone Hysterectomy: The Women's Health Initiative Randomized, Controlled Trials. Obstet Gynecol Surv. 2004 Aug;59(8):599-600.</citation>* * <PMID>15277894</PMID>* * </reference>* * <reference>* * <citation>Margolis KL, Bonds DE, Rodabough RJ, Tinker L, Phillips LS, Allen C, Bassford T, Burke G, Torrens J, Howard BV. Effect of oestrogen plus progestin on the incidence of diabetes in postmenopausal women: results from the Women's Health Initiative Hormone Trial. Diabetologia. 2004 Jul;47(7):1175-87. Epub 2004 Jul 14.</citation>* * <PMID>15252707</PMID>* * </reference>* * <reference>* * <citation>Naftolin F, Taylor HS, Karas R, Brinton E, Newman I, Clarkson TB, Mendelsohn M, Lobo RA, Judelson DR, Nachtigall LE, Heward CB, Hecht H, Jaff MR, Harman SM; Women's Health Initiative. The Women's Health Initiative could not have detected cardioprotective effects of starting hormone therapy during the menopausal transition. Fertil Steril. 2004 Jun;81(6):1498-501.</citation>* * <PMID>15193467</PMID>* * </reference>* * <reference>* * <citation>Chen Z, Kooperberg C, Pettinger MB, Bassford T, Cauley JA, LaCroix AZ, Lewis CE, Kipersztok S, Borne C, Jackson RD. Validity of self-report for fractures among a multiethnic cohort of postmenopausal women: results from the Women's Health Initiative observational study and clinical trials. Menopause. 2004 May-Jun;11(3):264-74.</citation>* * <PMID>15167305</PMID>* * </reference>* * <reference>* * <citation>Women's Health Initiative Study Group. Dietary adherence in the Women's Health Initiative Dietary Modification Trial. J Am Diet Assoc. 2004 Apr;104(4):654-8.</citation>* * <PMID>15054353</PMID>* * </reference>* * <reference>* * <citation>Cushman M, Kuller LH, Prentice R, Rodabough RJ, Psaty BM, Stafford RS, Sidney S, Rosendaal FR; Women's Health Initiative Investigators. Estrogen plus progestin and risk of venous thrombosis. JAMA. 2004 Oct 6;292(13):1573-80.</citation>* * <PMID>15467059</PMID>* * </reference>* * <reference>* * <citation>Heckbert SR, Kooperberg C, Safford MM, Psaty BM, Hsia J, McTiernan A, Gaziano JM, Frishman WH, Curb JD. Comparison of Self-Report, Hospital Discharge Codes, and Adjudication of Cardiovascular Events in the Women's Health Initiative. Am J Epidemiol. 2004 Dec 15;160(12):1152-8.</citation>* * <PMID>15583367</PMID>* * </reference>* * <reference>* * <citation>Wassertheil-Smoller S, Psaty B, Greenland P, Oberman A, Kotchen T, Mouton C, Black H, Aragaki A, Trevisan M. Association between cardiovascular outcomes and antihypertensive drug treatment in older women. JAMA. 2004 Dec 15;292(23):2849-59.</citation>* * <PMID>15598916</PMID>* * </reference>* * <reference>* * <citation>Espeland MA, Gu L, Masaki KH, Langer RD, Coker LH, Stefanick ML, Ockene J, Rapp SR. Association between Reported Alcohol Intake and Cognition: Results from the Women's Health Initiative Memory Study. Am J Epidemiol. 2005 Feb 1;161(3):228-38.</citation>* * <PMID>15671255</PMID>* * </reference>* * <reference>* * <citation>Howard BV, Adams-Campbell L, Allen C, Black H, Passaro M, Rodabough RJ, Rodriguez BL, Safford M, Stevens VJ, Wagenknecht LE. Insulin resistance and weight gain in postmenopausal women of diverse ethnic groups. Int J Obes Relat Metab Disord. 2004 Aug;28(8):1039-47.</citation>* * <PMID>15254486</PMID>* * </reference>* * <reference>* * <citation>Hendrix SL, Cochrane BB, Nygaard IE, Handa VL, Barnabei VM, Iglesia C, Aragaki A, Naughton MJ, Wallace RB, McNeeley SG. Effects of estrogen with and without progestin on urinary incontinence. JAMA. 2005 Feb 23;293(8):935-48.</citation>* * <PMID>15728164</PMID>* * </reference>* * <reference>* * <citation>Curb JD, McTiernan A, Heckbert SR, Kooperberg C, Stanford J, Nevitt M, Johnson KC, Proulx-Burns L, Pastore L, Criqui M, Daugherty S; WHI Morbidity and Mortality Committee. Outcomes ascertainment and adjudication methods in the Women's Health Initiative. Ann Epidemiol. 2003 Oct;13(9 Suppl):S122-8. No abstract available.</citation>* * <PMID>14575944</PMID>* * </reference>* * <reference>* * <citation>Hsia J, Wu L, Allen C, Oberman A, Lawson WE, Torrens J, Safford M, Limacher MC, Howard BV; Women's Health Initiative Research Group. Physical activity and diabetes risk in postmenopausal women. Am J Prev Med. 2005 Jan;28(1):19-25.</citation>* * <PMID>15626551</PMID>* * </reference>* * <reference>* * <citation>Cirillo DJ, Wallace RB, Rodabough RJ, Greenland P, LaCroix AZ, Limacher MC, Larson JC. Effect of estrogen therapy on gallbladder disease. JAMA. 2005 Jan 19;293(3):330-9.</citation>* * <PMID>15657326</PMID>* * </reference>* * <reference>* * <citation>Margolis KL, Manson JE, Greenland P, Rodabough RJ, Bray PF, Safford M, Grimm RH Jr, Howard BV, Assaf AR, Prentice R; Women's Health Initiative Research Group. Leukocyte count as a predictor of cardiovascular events and mortality in postmenopausal women: the Women's Health Initiative Observational Study. Arch Intern Med. 2005 Mar 14;165(5):500-8.</citation>* * <PMID>15767524</PMID>* * </reference>* * <reference>* * <citation>Howard BV, Kuller L, Langer R, Manson JE, Allen C, Assaf A, Cochrane BB, Larson JC, Lasser N, Rainford M, Van Horn L, Stefanick ML, Trevisan M. Risk of cardiovascular disease by hysterectomy status, with and without oophorectomy: the Women's Health Initiative Observational Study. Circulation. 2005 Mar 29;111(12):1462-70. Epub 2005 Mar 21.</citation>* * <PMID>15781742</PMID>* * </reference>* * <reference>* * <citation>Langer RD, Pradhan AD, Lewis CE, Manson JE, Rossouw JE, Hendrix SL, LaCroix AZ, Ridker PM. Baseline associations between postmenopausal hormone therapy and inflammatory, haemostatic, and lipid biomarkers of coronary heart disease. The Women's Health Initiative Observational Study. Thromb Haemost. 2005 Jun;93(6):1108-16.</citation>* * <PMID>15968396</PMID>* * </reference>* * <reference>* * <citation>Ockene JK, Barad DH, Cochrane BB, Larson JC, Gass M, Wassertheil-Smoller S, Manson JE, Barnabei VM, Lane DS, Brzyski RG, Rosal MC, Wylie-Rosett J, Hays J. Symptom experience after discontinuing use of estrogen plus progestin. JAMA. 2005 Jul 13;294(2):183-93.</citation>* * <PMID>16014592</PMID>* * </reference>* * <reference>* * <citation>Chen Z, Maricic M, Bassford TL, Pettinger M, Ritenbaugh C, Lopez AM, Barad DH, Gass M, Leboff MS. Fracture risk among breast cancer survivors: results from the Women's Health Initiative Observational Study. Arch Intern Med. 2005 Mar 14;165(5):552-8.</citation>* * <PMID>15767532</PMID>* * </reference>* * <reference>* * <citation>Fugate Woods N, Lacroix AZ, Gray SL, Aragaki A, Cochrane BB, Brunner RL, Masaki K, Murray A, Newman AB. Frailty: Emergence and Consequences in Women Aged 65 and Older in the Women's Health Initiative Observational Study. J Am Geriatr Soc. 2005 Aug;53(8):1321-30.</citation>* * <PMID>16078957</PMID>* * </reference>* * <reference>* * <citation>Michael YL, Perrin N, Bowen D, Cochrane BB, Wisdom JP, Brzyski R, Ritenbaugh C. Expression and ambivalence over expression of negative emotion:psychometric analysis in the Women's Health Initiative. J Women Aging. 2005;17(1-2):5-18.</citation>* * <PMID>15914416</PMID>* * </reference>* * <reference>* * <citation>Barnabei VM, Cochrane BB, Aragaki AK, Nygaard I, Williams RS, McGovern PG, Young RL, Wells EC, O'Sullivan MJ, Chen B, Schenken R, Johnson SR; Women's Health Initiative Investigators. Menopausal symptoms and treatment-related effects of estrogen and progestin in the Women's Health Initiative. Obstet Gynecol. 2005 May;105(5 Pt 1):1063-73.</citation>* * <PMID>15863546</PMID>* * </reference>* * <reference>* * <citation>Stefanick ML, Prentice RL, Anderson G, Gass M, Manson JE, Hendrix SL, Vista-Deck D, McNeeley G; Women's Health Initiative Steering Committee. Reanalysis of the Women's Health Initiative oral contraceptive data reveals no evidence of delayed cardiovascular benefit. Fertil Steril. 2005 Apr;83(4):853-4. Review.</citation>* * <PMID>15820789</PMID>* * </reference>* * <reference>* * <citation>Brunner RL, Gass M, Aragaki A, Hays J, Granek I, Woods N, Mason E, Brzyski R, Ockene J, Assaf A, LaCroix A, Matthews K, Wallace R; Women's Health Initiative Investigators. Effects of conjugated equine estrogen on health-related quality of life in postmenopausal women with hysterectomy: results from the Women's Health Initiative Randomized Clinical Trial. Arch Intern Med. 2005 Sep 26;165(17):1976-86.</citation>* * <PMID>16186467</PMID>* * </reference>* * <reference>* * <citation>Wampler NS, Chen Z, Jacobsen C, Henderson JA, Howard BV, Rossouw JE. Bone mineral density of American Indian and Alaska Native women compared with non-Hispanic white women: results from the Women's Health Initiative Study. Menopause. 2005 September/October;12(5):536-544. Epub 2005 Sep 1.</citation>* * <PMID>16145307</PMID>* * </reference>* * <reference>* * <citation>Prentice RL, Langer R, Stefanick ML, Howard BV, Pettinger M, Anderson G, Barad D, Curb JD, Kotchen J, Kuller L, Limacher M, Wactawski-Wende J. Combined Postmenopausal Hormone Therapy and Cardiovascular Disease: Toward Resolving the Discrepancy between Observational Studies and the Women's Health Initiative Clinical Trial. Am J Epidemiol. 2005 Sep 1;162(5):404-14. Epub 2005 Jul 20.</citation>* * <PMID>16033876</PMID>* * </reference>* * <reference>* * <citation>Chen Z, Bassford T, Green SB, Cauley JA, Jackson RD, LaCroix AZ, Leboff M, Stefanick ML, Margolis KL. Postmenopausal hormone therapy and body composition--a substudy of the estrogen plus progestin trial of the Women's Health Initiative. Am J Clin Nutr. 2005 Sep;82(3):651-6.</citation>* * <PMID>16155280</PMID>* * </reference>* * <reference>* * <citation>Wolf RL, Cauley JA, Pettinger M, Jackson R, Lacroix A, Leboff MS, Lewis CE, Nevitt MC, Simon JA, Stone KL, Wactawski-Wende J. Lack of a relation between vitamin and mineral antioxidants and bone mineral density: results from the Women's Health Initiative. Am J Clin Nutr. 2005 Sep;82(3):581-8.</citation>* * <PMID>16155271</PMID>* * </reference>* * <reference>* * <citation>Barad D, Kooperberg C, Wactawski-Wende J, Liu J, Hendrix SL, Watts NB. Prior oral contraception and postmenopausal fracture: a Women's Health Initiative observational cohort study. Fertil Steril. 2005 Aug;84(2):374-83.</citation>* * <PMID>16084878</PMID>* * </reference>* * <reference>* * <citation>Howard BV, Manson JE, Stefanick ML, Beresford SA, Frank G, Jones B, Rodabough RJ, Snetselaar L, Thomson C, Tinker L, Vitolins M, Prentice R. Low-fat dietary pattern and weight change over 7 years: the Women's Health Initiative Dietary Modification Trial. JAMA. 2006 Jan 4;295(1):39-49.</citation>* * <PMID>16391215</PMID>* * </reference>* * <results_reference>* * <citation>Stefanick ML, Anderson GL, Margolis KL, Hendrix SL, Rodabough RJ, Paskett ED, Lane DS, Hubbell FA, Assaf AR, Sarto GE, Schenken RS, Yasmeen S, Lessin L, Chlebowski RT; WHI Investigators. Effects of conjugated equine estrogens on breast cancer and mammography screening in postmenopausal women with hysterectomy. JAMA. 2006 Apr 12;295(14):1647-57.</citation>* * <PMID>16609086</PMID>* * </results_reference>* * <results_reference>* * <citation>Jackson RD, LaCroix AZ, Gass M, Wallace RB, Robbins J, Lewis CE, Bassford T, Beresford SA, Black HR, Blanchette P, Bonds DE, Brunner RL, Brzyski RG, Caan B, Cauley JA, Chlebowski RT, Cummings SR, Granek I, Hays J, Heiss G, Hendrix SL, Howard BV, Hsia J, Hubbell FA, Johnson KC, Judd H, Kotchen JM, Kuller LH, Langer RD, Lasser NL, Limacher MC, Ludlam S, Manson JE, Margolis KL, McGowan J, Ockene JK, O'Sullivan MJ, Phillips L, Prentice RL, Sarto GE, Stefanick ML, Van Horn L, Wactawski-Wende J, Whitlock E, Anderson GL, Assaf AR, Barad D; Women's Health Initiative Investigators. Calcium plus vitamin D supplementation and the risk of fractures. N Engl J Med. 2006 Feb 16;354(7):669-83. Erratum in: N Engl J Med. 2006 Mar 9;354(10):1102.</citation>* * <PMID>16481635</PMID>* * </results_reference>* * <results_reference>* * <citation>Howard BV, Van Horn L, Hsia J, Manson JE, Stefanick ML, Wassertheil-Smoller S, Kuller LH, LaCroix AZ, Langer RD, Lasser NL, Lewis CE, Limacher MC, Margolis KL, Mysiw WJ, Ockene JK, Parker LM, Perri MG, Phillips L, Prentice RL, Robbins J, Rossouw JE, Sarto GE, Schatz IJ, Snetselaar LG, Stevens VJ, Tinker LF, Trevisan M, Vitolins MZ, Anderson GL, Assaf AR, Bassford T, Beresford SA, Black HR, Brunner RL, Brzyski RG, Caan B, Chlebowski RT, Gass M, Granek I, Greenland P, Hays J, Heber D, Heiss G, Hendrix SL, Hubbell FA, Johnson KC, Kotchen JM. Low-fat dietary pattern and risk of cardiovascular disease: the Women's Health Initiative Randomized Controlled Dietary Modification Trial. JAMA. 2006 Feb 8;295(6):655-66.</citation>* * <PMID>16467234</PMID>* * </results_reference>* * <results_reference>* * <citation>Beresford SA, Johnson KC, Ritenbaugh C, Lasser NL, Snetselaar LG, Black HR, Anderson GL, Assaf AR, Bassford T, Bowen D, Brunner RL, Brzyski RG, Caan B, Chlebowski RT, Gass M, Harrigan RC, Hays J, Heber D, Heiss G, Hendrix SL, Howard BV, Hsia J, Hubbell FA, Jackson RD, Kotchen JM, Kuller LH, LaCroix AZ, Lane DS, Langer RD, Lewis CE, Manson JE, Margolis KL, Mossavar-Rahmani Y, Ockene JK, Parker LM, Perri MG, Phillips L, Prentice RL, Robbins J, Rossouw JE, Sarto GE, Stefanick ML, Van Horn L, Vitolins MZ, Wactawski-Wende J, Wallace RB, Whitlock E. Low-fat dietary pattern and risk of colorectal cancer: the Women's Health Initiative Randomized Controlled Dietary Modification Trial. JAMA. 2006 Feb 8;295(6):643-54.</citation>* * <PMID>16467233</PMID>* * </results_reference>* * <results_reference>* * <citation>Prentice RL, Caan B, Chlebowski RT, Patterson R, Kuller LH, Ockene JK, Margolis KL, Limacher MC, Manson JE, Parker LM, Paskett E, Phillips L, Robbins J, Rossouw JE, Sarto GE, Shikany JM, Stefanick ML, Thomson CA, Van Horn L, Vitolins MZ, Wactawski-Wende J, Wallace RB, Wassertheil-Smoller S, Whitlock E, Yano K, Adams-Campbell L, Anderson GL, Assaf AR, Beresford SA, Black HR, Brunner RL, Brzyski RG, Ford L, Gass M, Hays J, Heber D, Heiss G, Hendrix SL, Hsia J, Hubbell FA, Jackson RD, Johnson KC, Kotchen JM, LaCroix AZ, Lane DS, Langer RD, Lasser NL, Henderson MM. Low-fat dietary pattern and risk of invasive breast cancer: the Women's Health Initiative Randomized Controlled Dietary Modification Trial. JAMA. 2006 Feb 8;295(6):629-42.</citation>* * <PMID>16467232</PMID>* * </results_reference>* * <results_reference>* * <citation>Howard BV, Manson JE, Stefanick ML, Beresford SA, Frank G, Jones B, Rodabough RJ, Snetselaar L, Thomson C, Tinker L, Vitolins M, Prentice R. Low-fat dietary pattern and weight change over 7 years: the Women's Health Initiative Dietary Modification Trial. JAMA. 2006 Jan 4;295(1):39-49.</citation>* * <PMID>16391215</PMID>* * </results_reference>* * <results_reference>* * <citation>Yasmeen S, Romano PS, Pettinger M, Johnson SR, Hubbell FA, Lane DS, Hendrix SL. Incidence of cervical cytological abnormalities with aging in the women's health initiative: a randomized controlled trial. Obstet Gynecol. 2006 Aug;108(2):410-9.</citation>* * <PMID>16880313</PMID>* * </results_reference>* * <verification_date>January 2006</verification_date>* * <lastchanged_date>November 27, 2006</lastchanged_date>* * <firstreceived_date>October 27, 1999</firstreceived_date>* * <has_expanded_access>No</has_expanded_access>* * <condition_browse>* * <!-- CAUTION: The following MeSH terms are assigned with an imperfect algorithm -->* * <mesh_term>Bone Diseases</mesh_term>* * <mesh_term>Breast Neoplasms</mesh_term>* * <mesh_term>Neoplasms</mesh_term>* * <mesh_term>Cardiovascular Diseases</mesh_term>* * <mesh_term>Colonic Neoplasms</mesh_term>* * <mesh_term>Myocardial Ischemia</mesh_term>* * <mesh_term>Coronary Artery Disease</mesh_term>* * <mesh_term>Coronary Disease</mesh_term>* * <mesh_term>Heart Diseases</mesh_term>* * <mesh_term>Ischemia</mesh_term>* * <mesh_term>Osteoporosis</mesh_term>* * </condition_browse>* * <intervention_browse>* * <!-- CAUTION: The following MeSH terms are assigned with an imperfect algorithm -->* * <mesh_term>Vitamin D</mesh_term>* * <mesh_term>Vitamins</mesh_term>* * <mesh_term>Estrogens</mesh_term>* * <mesh_term>Hormones</mesh_term>* * <mesh_term>Progestins</mesh_term>* * </intervention_browse>* * <!-- Results have not yet been posted for this study -->* *</clinical_study>* *I have written the below script by considering one XML file but this is not working as per requirement since it generating many small file and I dont know how merge them to make one.* *Below is my Pig script.* *register piggybank.jar;A = load 'piglab/NCT00000611.xml' using org.apache.pig.piggybank.storage.XMLLoader('id_info')as (x: chararray);B = foreach A GENERATE FLATTEN(REGEX_EXTRACT_ALL(x, '<id_info>\\n\\s*<org_study_id>(.*)</org_study_id>\\n\\s*<nct_id>(.*)</nct_id>\\n\\s*</id_info>')) as (org_study_id: chararray,nct_id : chararray);C = foreach B GENERATE CONCAT('1$',CONCAT(CONCAT(org_study_id,'$'),nct_id));STORE C into 'piglab/result1';data = load 'piglab/result1' USING PigStorage('$') as (a1: int,a2: chararray,a3: chararray);A1 = load 'piglab/NCT00000611.xml' using org.apache.pig.piggybank.storage.XMLLoader('lead_sponsor')as (y: chararray);B1 = foreach A1 GENERATE FLATTEN(REGEX_EXTRACT_ALL(y, '<lead_sponsor>\\n\\s*<agency>(.*)</agency>\\n\\s*<agency_class>(.*)</agency_class>\\n\\s*</lead_sponsor>')) as (agency: chararray,agency_class: chararray);D = foreach B1 GENERATE CONCAT('1$',CONCAT(CONCAT(agency,'$'),agency_class));STORE D into 'piglab/result2';data1 = load 'piglab/result2' USING PigStorage('$') as (b1: int,b2: chararray,b3: chararray);result= JOIN data by a1,data1 by b1 ;store result into 'piglab/result' USING PigStorage('$');If you can give me one sample PIG script which parses such nested XML file then I can go forward with that.* Any help on this would be greatly appreciable. Thanks Haider > >
