1. Annotated protein coding genes: Group of genes Name of genes Subunits of ATP synthase atpA, atpB, atpE, atpF, atpH, atpI Subunits of NADH-dehydrogenase ndhA, ndhB, ndhB_copy2, ndhC, ndhD, ndhE, ndhF, ndhG, ndhH, ndhI, ndhJ, ndhK Subunits of cytochrome b/f complex petA, petB, petD, petG, petL, petN Subunits of photosystem I psaA, psaB, psaC, psaI, psaJ Subunits of photosystem II psbA, psbB, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbL, psbM, psbN, psbT, psbZ, ycf3 Large subunit of ribosome rpl14, rpl16, rpl2, rpl20, rpl22, rpl23, rpl23_copy2, rpl23_copy3, rpl2_copy2, rpl32, rpl33, rpl36 Small subunit of ribosome rps11, rps12, rps14, rps15, rps15_copy2, rps16, rps18, rps19, rps19_copy2, rps2, rps3, rps4, rps7, rps7_copy2, rps8 DNA dependent RNA polymerase rpoA, rpoB, rpoC1, rpoC2 Subunit of rubisco rbcL c-type cytochrom synthesis gene ccsA Envelop membrane protein cemA Maturase matK Protease clpP Subunit of Acetyl-CoA-carboxylase None Translational initiation factor infA Conserved open reading frames ycf1, ycf15, ycf15_copy2, ycf1_copy2, ycf4 Other genes None Possible missing protein coding genes: Group of genes Name of genes Subunits of ATP synthase None Subunits of NADH-dehydrogenase None Subunits of cytochrome b/f complex None Subunits of photosystem I None Subunits of photosystem II None Large subunit of ribosome None Small subunit of ribosome None DNA dependent RNA polymerase None Subunit of rubisco None c-type cytochrom synthesis gene None Envelop membrane protein None Maturase None Protease None Subunit of Acetyl-CoA-carboxylase accD Translational initiation factor None Conserved open reading frames ycf2 Other genes None 2. Multicopy genes with different lengths: Genes Length Location rpl23 228 [56181:56409](+) rpl23_copy2 282 [81815:82097](-) rpl23_copy3 282 [132181:132463](+) rpl2 840 join{[81397:81809](-), [80312:80740](-)} rpl2_copy2 837 join{[132469:132878](+), [133537:133965](+)} ndhB 1626 join{[86526:87396](-), [85151:85907](-)} ndhB_copy2 1623 join{[126882:127749](+), [128371:129127](+)} 3. Common intron-contained genes: Genes exons exon1 exon2 exon3 atpF (2/2) [33198:33357](+) [34147:34553](+) clpP (1/3) [67306:67956](-) ndhA (2/2) [111744:112296](-) [110186:110724](-) ndhB (2/2) [86527:87396](-) [85152:85907](-) ndhB_copy2 (2/2) [126883:127749](+) [128372:129127](+) petB (1/2) [71561:72259](+) petD (1/2) [73157:73681](+) rpl16 (1/2) [77040:77450](-) rpl2 (2/2) [81398:81809](-) [80313:80740](-) rpl2_copy2 (2/2) [132470:132878](+) [133538:133965](+) rpoC1 (1/2) [22904:24946](+) rps12 (2/3) [67018:67164](-) [88783:89061](-) rps16 (1/2) [4433:4648](-) ycf3 (3/3) [44116:44242](-) [43152:43381](-) [42258:42416](-) ### The exon column shows the number of (actual exons/expected exons). ### 4. Genes with internal stop codon: None 5. Genes with nonstandard start codon: Genes Start codon psbI ATC petN ATT psbT ATC rps19 GTG rpl2 ATA rpl2_copy2 ATA rps19_copy2 GTG 6. Genes with nonstandard stop codon: Genes Stop codon None None 7. Conclusion This genome has 128 genes (107 unique genes). Including: 87 protein-coding genes (78 are unique) 33 tRNA genes (25 are unique) 8 rRNA genes (4 are unique) Details are as follows: 87 protein-coding genes (78 are unique) Genes Length Location atpA 1524 [34649:36172](+) atpB 1497 [52226:53722](-) atpE 414 [51816:52229](-) atpF 567 [33198:33357](+) [34147:34553](+) atpH 246 [32480:32725](+) atpI 744 [30939:31682](+) ccsA 969 [104759:105727](+) cemA 678 [58182:58859](+) clpP 651 [67306:67956](-) infA 279 [75622:75900](-) matK 1542 [1632:3173](-) ndhA 1092 [111744:112296](-) [110186:110724](-) ndhB 1626 [86527:87396](-) [85152:85907](-) ndhB_copy2 1623 [126883:127749](+) [128372:129127](+) ndhC 363 [49407:49769](-) ndhD 1503 [105921:107423](-) ndhE 306 [108256:108561](-) ndhF 2220 [100781:103000](-) ndhG 531 [108758:109288](-) ndhH 1182 [112298:113479](-) ndhI 543 [109550:110092](-) ndhJ 480 [48090:48569](-) ndhK 747 [48670:49416](-) petA 963 [59085:60047](+) petB 699 [71561:72259](+) petD 525 [73157:73681](+) petG 114 [63344:63457](+) petL 96 [63070:63165](+) petN 96 [17691:17786](-) psaA 2253 [39469:41721](-) psaB 2205 [37239:39443](-) psaC 246 [107543:107788](-) psaI 111 [56750:56860](+) psaJ 129 [64267:64395](+) psbA 1062 [31:1092](-) psbB 1527 [68412:69938](+) psbC 1422 [10095:11516](+) psbD 1062 [9086:10147](+) psbE 252 [61499:61750](-) psbF 120 [61369:61488](-) psbH 222 [70493:70714](+) psbI 147 [7790:7936](+) psbJ 123 [60982:61104](-) psbK 186 [7243:7428](+) psbL 117 [61230:61346](-) psbM 105 [16762:16866](+) psbN 132 [70257:70388](-) psbT 111 [70094:70204](+) psbZ 189 [12085:12273](+) rbcL 1452 [54498:55949](+) rpl14 372 [76531:76902](-) rpl16 411 [77040:77450](-) rpl2 840 [81398:81809](-) [80313:80740](-) rpl20 360 [66004:66363](-) rpl22 450 [79246:79695](-) rpl23 228 [56182:56409](+) rpl23_copy2 282 [81816:82097](-) rpl23_copy3 282 [132182:132463](+) rpl2_copy2 837 [132470:132878](+) [133538:133965](+) rpl32 165 [103909:104073](+) rpl33 201 [64805:65005](+) rpl36 114 [75393:75506](-) rpoA 750 [73967:74716](-) rpoB 3228 [19639:22866](+) rpoC1 2043 [22904:24946](+) rpoC2 4530 [25146:29675](+) rps11 432 [74790:75221](-) rps12 426 [67018:67164](-) [88783:89061](-) rps14 312 [36782:37093](-) rps15 273 [100405:100677](+) rps15_copy2 273 [113602:113874](-) rps16 216 [4433:4648](-) rps18 492 [65285:65776](+) rps19 282 [79780:80061](-) rps19_copy2 282 [134217:134498](+) rps2 711 [29978:30688](+) rps3 675 [78514:79188](-) rps4 606 [45099:45704](-) rps7 471 [87696:88166](-) rps7_copy2 471 [126113:126583](+) rps8 411 [75980:76390](-) ycf1 192 [99786:99977](+) ycf15 192 [89879:90070](-) ycf15_copy2 192 [124209:124400](+) ycf1_copy2 192 [114302:114493](-) ycf3 516 [44116:44242](-) [43152:43381](-) [42258:42416](-) ycf4 558 [57220:57777](+) 33 tRNA genes (25 are unique) Genes Location trnK-TTT [3853:3890](-) [1331:1363](-) trnQ-TTG [6823:6894](-) trnS-GCT [8052:8144](-) trnS-TGA [11669:11756](-) trnG-GCC [12422:12492](+) trnfM-CAT [12883:12956](-) trnT-GGT [14837:14908](+) trnE-TTC [15410:15482](+) trnY-GTA [15544:15627](+) trnC-GCA [18559:18629](-) trnR-TCT [36334:36405](-) trnS-GGA [44754:44840](+) trnT-TGT [46006:46078](-) trnF-GAA [47456:47528](+) trnM-CAT [51631:51703](+) trnW-CCA [63581:63654](-) trnP-TGG [63785:63858](-) trnN-GTT [78368:78442](+) trnH-GTG [80194:80267](+) trnI-CAT [82272:82345](-) trnL-CAA [84506:84586](-) trnV-GAC [90817:90888](+) trnA-TGC [93769:93806](+) [94617:94651](+) trnR-ACG [98479:98552](+) trnN-GTT_copy2 [98806:98877](-) trnL-TAG [104602:104681](+) trnN-GTT_copy3 [115402:115473](+) trnR-ACG_copy2 [115727:115800](-) trnA-TGC_copy2 [120474:120511](-) [119629:119663](-) trnV-GAC_copy2 [123391:123462](-) trnL-CAA_copy2 [129693:129773](+) trnI-CAT_copy2 [131934:132007](+) trnH-GTG_copy2 [134011:134084](-) 8 rRNA genes (4 are unique) Genes Location rrn16S [91118:92609](-) rrn23S [94797:97684](-) rrn4.5S [97780:97874](-) rrn5S [98102:98222](+) rrn5S_copy2 [116058:116178](-) rrn4.5S_copy2 [116406:116500](+) rrn23S_copy2 [116596:119483](+) rrn16S_copy2 [121670:123161](+) This genome has 11 genes with intron(s). Details are as follows: 10 gene(s) have 1 intron(s) Genes Exons rps12 [67018:67164](-) [88783:89061](-) atpF [33198:33357](+) [34147:34553](+) rpl2 [81398:81809](-) [80313:80740](-) ndhB [86527:87396](-) [85152:85907](-) ndhA [111744:112296](-) [110186:110724](-) ndhB_copy2 [126883:127749](+) [128372:129127](+) rpl2_copy2 [132470:132878](+) [133538:133965](+) trnK-TTT [3853:3890](-) [1331:1363](-) trnA-TGC [93769:93806](+) [94617:94651](+) trnA-TGC_copy2 [120474:120511](-) [119629:119663](-) 1 gene(s) have 2 intron(s) Genes Exons ycf3 [44116:44242](-) [43152:43381](-) [42258:42416](-)