f-treeGC: a questionnaire-based family tree-creation software for genetic counseling and genome cohort studies
BMC Medical Genetics volume 18, Article number: 71 (2017)
The Tohoku Medical Megabank project aims to create a next-generation personalized healthcare system by conducting large-scale genome-cohort studies involving three generations of local residents in the areas affected by the Great East Japan Earthquake. We collected medical and genomic information for developing a biobank to be used for this healthcare system. We designed a questionnaire-based pedigree-creation software program named “f-treeGC,” which enables even less experienced medical practitioners to accurately and rapidly collect family health history and create pedigree charts.
f-treeGC may be run on Adobe AIR. Pedigree charts are created in the following manner: 1) At system startup, the client is prompted to provide required information on the presence or absence of children; f-treeGC is capable of creating a pedigree up to three generations. 2) An interviewer fills out a multiple-choice questionnaire on genealogical information. 3) The information requested includes name, age, gender, general status, infertility status, pregnancy status, fetal status, and physical features or health conditions of individuals over three generations. In addition, information regarding the client and the proband, and birth order information, including multiple gestation, custody, multiple individuals, donor or surrogate, adoption, and consanguinity may be included. 4) f-treeGC shows only marriages between first cousins via the overlay function. 5) f-treeGC automatically creates a pedigree chart, and the chart-creation process is visible for inspection on the screen in real time. 6) The genealogical data may be saved as a file in the original format. The created/modified date and time may be changed as required, and the file may be password-protected and/or saved in read-only format. To enable sorting or searching from the database, the file name automatically contains the terms typed into the entry fields, including physical features or health conditions, by default. 7) Alternatively, family histories are collected using a completed foldable interview paper sheet named “f-sheet”, which is identical to the questionnaire in f-treeGC.
We developed a questionnaire-based family tree-creation software, named f-treeGC, which is fully compliant with international recommendations for standardized human pedigree nomenclature. The present software simplifies the process of collecting family histories and pedigrees, and has a variety of uses, from genome cohort studies or primary care to genetic counseling.
Genealogical information is critical for accurate genetic diagnosis in clinical genetics. In 1995, the National Society of Genetic Counselors  introduced a standardized description method for presenting a family tree in genetic counseling, which was revised in 2008 to the current version . Typically, pedigree charts are manually created via face-to-face personal interviews held on an individual basis. However, pedigree chart creation requires graphical skills and specialized knowledge of clinical genetics, and is thus a time- and labor-intensive process. A large amount of genetic data has been collected from numerous large-scale studies conducted in recent years, such as genome-cohort studies. The availability of accurate genealogical information is essential for performing comprehensive analysis of polymorphisms and associated diseases for use in genetic counseling, research, and diagnosis. Given the large amount of genetic information required and collected, it is currently not possible to create pedigree charts using the traditional method through interviews at a single recruiting event by an expert.
The Tohoku Medical Megabank (TMM) project  aims to restore community medical services that were negatively affected by the Great East Japan Earthquake, and to create a next-generation personalized healthcare system by conducting large-scale genome-cohort studies involving three generations of local residents in the disaster-stricken areas . Specifically, we collected medical and genomic information, including family health history, for developing a biobank to be used in the planned healthcare system.
In this project, we designed a questionnaire-based pedigree-drawing software program named “f-treeGC”, which enables even less-experienced medical practitioners to accurately and rapidly collect genealogical information and create pedigree charts, in full compliance with international standards .
f-treeGC is written in ActionScript 3.0, and may be run on Adobe AIR, which is a cross-platform runtime system.
f-treeGC is supported by both Windows (Windows 7, 8, and 10) and Macintosh (operating system (OS) X). Adobe AIR Runtime  must be installed before installation and use of f-treeGC, and Adobe Reader DC  is required for the printing function. Via these programs, the f-treeGC air file may be opened to install the software. The f-treeGC software program is available for use, at no monetary cost, at the Iwate Medical University Hospital website (http://www.iwate-med.ac.jp/hospital/clinics/medical/m26/).
The method for the creation of pedigree charts is described in the following sections.
Confirmation of whether or not the client has a child
At system startup, the client is prompted to provide required information on the presence or absence of children (Additional file 1a). f-treeGC is capable of creating a pedigree that includes three generations (Fig. 1a). Couples with offspring are included in the 2nd generation of the family tree (Fig. 2a, Additional file 2), whereas clients with no children are included in the 3rd generation (Figs. 1a and 2b, Additional file 3). A representative correspondence table is shown in Table 1.
The genealogical information requested includes name, age, gender, general status (affected, asymptomatic/presymptomatic carrier, carrier, or deceased), infertility status, pregnancy status, fetal status, and health status (occurrence of any diseases) of individuals in the three generations. In addition, information regarding client and proband, as well as birth order information such as multiple gestation, custody, multiple individuals, donor or surrogate, adoption, and consanguinity may be included (Fig. 1).
By default, the “Name” field refers to the type of relationship such as father or mother. The user should delete the relationship name before inputting the relevant name (Fig. 1b). In the “Age” box (Fig. 1c) and the “Gestational age (weeks)” box (Fig. 1d), the user may select “In blank”, “? (unknown)”, or the relevant number. Inputting the same number of individuals in the “Multiple-gestation ID” (Fig. 1e) indicates multiple gestation or pregnancy with multiple fetuses. The “Monozygotic” box (Fig. 1f) is for identical twins (pregnancy). The “Custody” box (Fig. 1g) is for the position of a break in a relationship line between divorced partners, and indicates the parent(s) with primary responsibility for the children following divorce. For multiple individuals, users may select “n (unknown)” or the relevant number(s) after checking the “Multiple individuals” box (Fig. 1h). The “Affected” button of general status (Fig. 1i) is for affected individuals, and users may set a key color for the affected status in the configuration (Fig. 1j). For affected individuals with two or more conditions, the user may check for the conditions (Fig. 1k). Considering the diversity in color perception and to enable distinction in subsequent black-and-white photocopies, f-treeGC shows multiple conditions using a color of a similar shade (Additional file 1b).
A printed paper version of the questionnaire
Alternatively, family health histories may be collected using the foldable interview sheet named “f-sheet” (Fig. 3, Additional file 4), which is a printed paper version of the questionnaire in f-treeGC. The f-sheet provides an overview of genetic relationships between families according to the manner in which the sheet is folded or developed. For example, the vertical line of the folded f-sheet corresponds to the left panel of the f-treeGC (Fig. 1a). Filling out a multiple-choice questionnaire on genetic information is easier for a medical practitioner with poor digital literacy. A skilled data entry clerk may subsequently input family health histories into f-treeGC from the f-sheet as a bundle.
Overlay function for consanguinity
With respect to consanguinity, f-treeGC shows only marriages between first cousins using the overlay function. The “Consanguinity” box should be checked and the same “Overlay ID” should be entered for this function to be effective (Figs. 1l and 4, Additional file 5).
Entry fields and keys for physical features or diseases/conditions for genome cohort studies
By default, the file name automatically contains the first ten letters (Fig. 1m) that are typed into the sixteen entry fields of the “Disease, etc./KEY,” such as disease name (Fig. 1n), and for which the box is checked (Fig. 1o), to enable sorting or searching from the database. For example, the preliminary physical features entered are hair, ears, eyes, nose, philtrum, oral region, neck, hands/feet, chest, skin, abdominal, genitalia, and skeletal . Diseases or conditions included in the My Family Health Portrait tool  include cancer, clotting disorder, dementia/Alzheimer’s disease, diabetes, gastrointestinal disorder, heart disease, high cholesterol, hypertension, kidney disease, lung disease, osteoporosis, psychological disorder, septicemia, stroke/brain attack, sudden infant death syndrome, and unknown disease (Fig. 1o). The first four out of sixteen entry fields may also be used as keys related to the multiple conditions of an affected individual (Fig. 1k, p). The “Show below symbol” box is for showing the entry below each symbol, only if the box on the left of the entry field is checked (Fig. 1q).
Pedigree chart creation
f-treeGC automatically creates a pedigree chart; the chart-creation process is visible for inspection on the screen in real time (Fig. 1r). The “Show the PEDIGREE” button (Fig. 1s) at the top of the screen displays a larger chart (Fig. 2b). The user may additionally edit a pedigree chart to visualize the larger chart with multiple displays.
A context menu appears when the family tree is right-clicked. The user may select “Add comment,” right-click on the comment box, and then type the appropriate text. Holding down the left mouse button allows the comment in the family tree to be moved and placed accordingly. The user may delete the comment, if required, by selecting “Delete comment”. By default, a pedigree chart does not include the name input (Fig. 1b, t) for privacy. The user may see names by selecting “Show names” in the context menu. To print or save an image of the pedigree, including names or other health conditions, the user may capture a screen shot (Fig. 2b, Additional file 3), or input the names manually in the “Remarks” box (Figs. 1u and 2a, Additional file 2).
Saving and reading the data
The genealogical information data may be saved as the original format file (FTGC file) by clicking the “Save the FTGC DATA” button (Fig. 1v), and shared between computers that have f-treeGC installed. The created/modified date and time may be changed as necessary before saving the data (Additional file 1c). In addition, the image of the pedigree chart may be saved in PDF format by clicking the “Save an IMAGE as PDF” button (Figs. 1t and 2a). The file may be password-protected and/or saved in a read-only format (Additional file 1c). A saved file may be read by clicking the “Read the FTGC DATA” button (Fig. 1w) at the top of the screen and selecting the file.
Comparison of the pedigree symbols used
A comparison of the pedigree symbols used in f-treeGC with those of several existing tools [8,9,10,11] is shown in Table 2. f-treeGC complies fully with the international recommendations of standardized human pedigree nomenclature , including common pedigree symbols, pedigree lines, assisted reproductive technology symbols, and pedigree symbols of genetic evaluation/testing information (Additional file 6).
System features and functions of f-treeGC as a family history collection tool
A well-designed online family history tool from the US Surgeon General, called My Family Health Portrait, is used for the collection and storage of family history data [11, 12]. In addition, the MeTree software program developed by the Duke Center for Applied Genomics and Precision Medicine enables collection of family health history and provides clinical decision-making support for more than 30 conditions such as cancer, cardiovascular diseases, liver, and diabetes [12, 13]. The Global Alliance for Genomics and Health (GA4GH) provides the GA4GH family history collection and clinical decision support tool inventory, and is open for submission. The information collected by f-treeGC is shown in Table 3 and in Additional file 7 derived from the submission form of the GA4GH family history tools catalog .
Verification of the software and interview sheet
To verify that f-treeGC enables users without specialized knowledge of clinical genetics and graphical skills to easily create medical pedigrees, we provided nine subjects (six nurses and three clerks) with two scenarios (D, Duchenne muscular dystrophy; P, phenylketonuria) of fictitious family histories (Additional file 8), and compared the pedigrees obtained by f-treeGC (Additional file 9) with those derived manually. The pedigrees were scored according to a system of allocation points (Additional file 8) based on the international standard  to examine the performance and usability of f-treeGC. The creation time was indefinite, and we divided the trees into two groups with different orders of scenarios applied. We used Windows 7 as the OS for this test.
To verify that f-sheet improves the user experience for data input to f-treeGC, we provided 47 high school students from one high school (males, N = 28; females, N = 19; age range, 15–16 years; grade, the first year) with the software and data for two scenarios. Students were randomly assigned to two groups: students of one group created pedigrees for both scenarios (Additional file 8) using f-treeGC without f-sheet (group TT; males, N = 17; females, N = 6). Students of the other group first created pedigrees for scenario D using f-treeGC without f-sheet, and then created pedigrees for scenario P using f-treeGC with a completed f-sheet (group TS; males, N = 11; females, N = 13). The pedigrees with or without f-sheet were scored using our points allocation system (Additional file 8) based on the international standard ; then, the scores were compared to examine the efficacy and usability of f-sheet. The creation time was indefinite. The OS used for this purpose was Windows 7.
The Wilcoxon signed-rank test and Mann-Whitney U-test were performed for statistical analyses, using Statcel4 software (OMS Ltd. Publishing, Saitama, Japan). Significance was set at p < 0.05.
The family trees obtained using f-treeGC had higher scores than those that were manually created (p < 0.001) (Table 4). Moreover, the input time and family tree scores of trees created using f-treeGC were not affected by the difference in scenario content, order of application, or the qualifications of each user (Table 4). Furthermore, the scores of the family trees created using f-treeGC with a completed f-sheet were higher than those created using f-treeGC without f-sheet (p < 0.01) (Table 5).
In the present study, we report the development of f-treeGC, a free stand-alone application built as a cross-platform runtime system. f-treeGC is capable of automatically creating a medical family tree compliant with international standards  by filling out available family tree information on a medical interview sheet (Fig. 1). Family histories are entered as both structured data and free text by the clinician or data entry clerk, and collected from patients or through f-sheet, which is a printed paper version of the questionnaire in f-treeGC (Fig. 3, Additional file 4). The family history data are stored in a computer in the original format file. f-treeGC may be used for collecting family health histories and creating pedigrees for individuals participating in situations such as primary care, genetic counseling, or genome cohort studies. The targeted clinical populations are recipients of primary and specialty health care facilities.
f-treeGC simplifies the process of creating pedigrees by confirming whether the client has offspring at system startup (Table 1, Additional file 1a) and by using the overlay function for confirming consanguinity (Fig. 4). Here, we show that f-treeGC, which is fully compliant with international recommendations for standardized human pedigree nomenclature (Table 2), is highly useful for creating pedigree charts for applications in genetic counseling.
However, the present study is not without limitations. As f-treeGC is only capable of creating a pedigree up to three generations, this software is currently unsuitable for creating large pedigrees. There are no auxiliary input functions for medical terms, pedigree-overlay function, nor a calculator for determining disease risk. f-treeGC is not adapted for compliance with Health-Level 7 (HL7) standards. Although numerous health and medical conditions exist , f-treeGC is limited to only sixteen medical conditions per person.
Low quality of family history data collected presents a challenge in pedigree analysis . Before collecting family health histories, users should guide patients regarding what to inquire of relatives, as the amount and accuracy of the family history is limited. The Iwate Tohoku Medical Megabank Organization conducts genetics workshops mentored by medical geneticists or genetic counselors to highlight the importance of family health history before recruiting participants for cohort studies of the TMM project.
Family history, the ultimate genetic tool , is the most cost-effective and well known “genetic test” in clinical practice today . However, recording family trees according to standard recommendations generally requires knowledge of graphical interfaces and clinical genetics . In 2016, the National Human Genome Research Institute (NHGRI) convened a Family Health History Tool Meeting at the National Institute of Health (NIH) for identifying and sharing successful approaches to using family health history tools, and for identifying unresolved issues and potential solutions that may be addressed by policy, research, and/or collaborative efforts. The removal of barriers to health equality in populations with low levels of literacy, and exploration/expansion additional technological approaches for family health history collection was discussed in this meeting .
Six years have passed since the Great East Japan Earthquake and Tsunami. However, health and medical services has not been fully restored to date. The TMM project initiated two prospective cohort studies in the Miyagi and Iwate prefectures, which include the disaster-stricken areas: a population-based adult cohort study, in which 80,000 participants were recruited, and a birth and three-generation cohort study, in which 70,000 participants, included fetuses and their parents, siblings, grandparents, and extended family members, were recruited . Collection of significant numbers of family health histories by conventional pedigree-drawing software programs is challenging in these regions owing to the lack of good internet service and personal computers. We used f-treeGC for genetic counseling at our institution, collecting approximately 100 patient histories and corresponding data, which would have taken a genetic counselor around twenty minutes in a clinical setting. In contrast, the software took about one minute per person to input two clinical scenarios at the verification experiment (Table 4). Since the use of f-treeGC in combination with f-sheet simplifies the process of collection of many numerous health histories and pedigrees (Table 5), its application is not limited to heredity clinics, but also to large-scale genome-cohort studies that handle large amounts of genetic information obtained through interviews at a single recruiting event.
The main advantages of f-treeGC are collection of several family histories for large-scale cohort studies in a short period of time, easing the burden of collection of genealogical information and creation of pedigree charts in remote medical practice by less experienced medical practitioners. In addition, the present tool facilitates online genetic counseling owing to its complete compliance with the international recommendations for standardized human pedigree nomenclature (Table 2).
Public awareness regarding the basic principles of genetics should be considered for the improvement of public health. Familial/pedigree information is valuable for variant filtering in high-throughput sequencing studies [17, 18]. Molecular approaches for the identification of disease-associated genes generally begin with pedigree-based methods, including positional cloning and founder gene approaches, prior to the use of pedigree-independent methods such as candidate gene approaches and genome-wide association studies . With the recent explosion in whole-genome sequencing, linkage analysis has emerged as an important and powerful analytical method for the identification of genes involved in disease etiology, often in conjunction with whole-genome sequencing filtering approaches . From this perspective, f-treeGC is a useful tool, not only for facile and accurate pedigree analysis, but also for conveniently collecting numerous family histories and pedigrees simultaneously. In future, we aim to add a calculator function for determining disease risk, an auxiliary input function for medical terms, a search function for family health conditions from free text, an adaption for HL7, and a pedigree-overlay function to the present f-treeGC software.
The f-treeGC software enables collection of family health history and automatically creates a medical family tree simply by filling out family tree information on a medical interview sheet, or by inputting the information in the questionnaire directly from the f-sheet.
Availability and requirements
Project name: TMM project
Project home page: http://www.amed.go.jp/en/program/list/04/01/042.html
Operating systems: Windows and Macintosh
Programing language: ActionScript 3.0
Licence: f-treeGC is a non-copylefted software, and is copyrighted by the Iwate Medical University and Holonic Systems, Ltd. The source code is not available.
Global Alliance for Genomics and Health
- TMM project:
Tohoku Medical Megabank project
Bennett RL, Steinhaus KA, Uhrich SB, O'Sullivan CK, Resta RG, Lochner-Doyle D, et al. Recommendations for standardized human pedigree nomenclature. J Genet Couns. 1995;4:267–79.
Bennett RL, French KS, Resta RG, Doyle DL. Standardized human pedigree nomenclature: update and assessment of the recommendations of the National Society of genetic counselors. J Genet Couns. 2008;17:424–33.
Tohoku Medical Megabank project. http://www.amed.go.jp/en/program/list/04/01/042.html. Accessed 14 May 2017.
Kuriyama S, Yaegashi N, Nagami F, Arai T, Kawaguchi Y, Osumi N, et al. The Tohoku Medical Megabank project: design and mission. J Epidemiol. 2016;26:493–511.
Adobe AIR Runtime. http://get.adobe.com/jp/air/. Accessed 14 May 2017.
Adobe Reader DC. http://get.adobe.com/jp/reader/. Accessed 14 May 2017.
Bennett RL. The practical guide to the genetic family history. 2nd ed. Hoboken: Wiley-Blackwell; 2010.
PedigreeXP. PC PAL. https://www.pedigreexp.com/. Accessed 14 May 2017.
Progeny Free Online Pedigree Tool. Progeny Genetics LLC. http://www.progenygenetics.com/online-pedigree/. Accessed 14 May 2017.
Genial Pedigree Draw. Genial Genetic Solutions Ltd. http://www.pedigreedraw.com. Accessed 14 May 2017.
My Family Health Portrait. US Surgeon General. https://familyhistory.hhs.gov/FHH/html/index.html?setLng=en. Accessed 14 May 2017.
Global Alliance for Genomics and Health. http://genomicsandhealth.org/work-products-demonstration-projects/catalogue-global-activities-family-history-tools. Accessed 14 May 2017.
Orlando LA, Buchanan AH, Hahn SE, Christianson CA, Powell KP, Skinner CS, et al. Development and validation of a primary care-based family health history and decision support program (MeTree). N C Med J. 2013;74:287–96.
The NHGRI Family Health History Tool Meeting 2016. https://www.genome.gov/27565264/the-nih-family-health-history-tool-conference-2016/. Accessed 14 May 2017.
Uhlmann WR, Scheuette JL, Yashar BM. A guide to genetic counseling. 2nd ed. Hoboken: Wiley-Blackwell; 2009.
Duke Center for Applied Genomic & Precision Medicine. https://precisionmedicine.duke.edu. Accessed 14 May 2017.
Wijsman EM. The role of large pedigrees in an era of high-throughput sequencing. Hum Genet. 2012;131:1555–63.
Bahlo M, Tankard R, Lukic V, Oliver KL, Smith KR. Using familial information for variant filtering in high-throughput sequencing studies. Hum Genet. 2014;133:1331–41.
Bakhtiar SM, Ali A, Baig SM, Barh D, Miyoshi A, Azevedo V. Identifying human disease genes: advances in molecular genetics and computational approaches. Genet Mol Res. 2014;13:5073–87.
Ott J, Wang J, Leal SM. Genetic linkage analysis in the age of whole-genome sequencing. Nat Rev Genet. 2015;16:275–84.
The authors thank the members of the Iwate Tohoku Medical Megabank Organization for suggestions and comments; Dr. Hiromasa Ono and Dr. Hidemasa Bono, Database Center for Life Science, for manual preparation; the members of Akita IP Firm for valuable suggestions; the members of Editage for English proofreading and valuable comments; Dr. Tohru Ohta, Dr. Rie Takai, and Dr. Norio Niikawa, Health Science University of Hokkaido, for experimental support and valuable comments; and Ms. Fumiyo Nakayama, Iwate Tohoku Medical Megabank Organization, for her assistance in preparing the manuscript; Mr. Yusuke Sakurai, Iwate Medical University Hospital, for generation of the website.
This research is supported by the Ministry of Education, Culture, Sports, Science and Technology (MEXT), Japan Agency for Medical Research and Development (AMED), and Iwate Industry Promotion Center.
Availability of data and materials
The datasets supporting the conclusions are included in this article and in the additional files. The f-treeGC software program is available at no monetary charge at the Iwate Medical University Hospital website, http://www.iwate-med.ac.jp/hospital/clinics/medical/m26/, and the f-sheets are available at no monetary charge as additional files with this article.
f-treeGC is a non-copylefted software, and is copyrighted by the Iwate Medical University and Holonic Systems, Ltd. The source code is not available. f-treeGC is written in ActionScript 3.0, and may be run on Adobe AIR, which is a cross-platform runtime system. f-treeGC is supported by both Windows (Windows 7, 8, and 10) and Macintosh (OS X). An application for the f-treeGC software program is under review for the international Patent Cooperation Treaty (PCT/JP2016/078494).
f-sheet is copyrighted by the Iwate Medical University and registered as a utility model in Japan (3204531).
Ethics approval and consent to participate
Ethical approval for our study was obtained from the institutional review board of the Health Science University of Hokkaido (Approval ID: 174). This study was conducted in accordance with the Declaration of Helsinki, Japanese Act on the Protection of Personal Information, and Japan Ethical Guidelines for Medical and Health Research Involving Human Subjects. We obtained informed consent from parents of all the students enrolled in our study.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Dialog boxes of f-treeGC (a) Confirmation of whether or not the client has a child at system startup, (b) Configuration of the color for affected individuals, (c) File attribute setting for changing the created/modified date and time, setting a password, and converting to read-only. (PPTX 141 kb)
f-sheet; a printed paper version of the questionnaire in f-treeGC. (XLSX 57 kb)
A pedigree file of Fig. 4d; with respect to consanguinity, f-treeGC shows only marriages between first cousins using the overlay function. (FTGC 8 kb)
List of symbols used in f-treeGC; f-treeGC fully complies with the international recommendations of standardized human pedigree nomenclature . (PDF 8 kb)
Current family history collection tools and f-treeGC; partially modified from GA4GH family history collection and clinical decision support tool inventory 6–9-16 v4.1 by Clinical Working Group of the Global Alliance for Genomics and Health. (XLSX 18 kb)
Two scenarios for the creation of pedigrees; scenario D (Duchenne muscular dystrophy) and scenario P (phenylketonuria). Any resemblance to real persons and pedigrees, living or dead, is purely coincidental. (XLSX 12 kb)
Model pedigrees of scenarios outlined (a) Scenario D, (b) scenario P. (PPTX 389 kb)
About this article
Cite this article
Tokutomi, T., Fukushima, A., Yamamoto, K. et al. f-treeGC: a questionnaire-based family tree-creation software for genetic counseling and genome cohort studies. BMC Med Genet 18, 71 (2017). https://doi.org/10.1186/s12881-017-0433-4