1. Annotated protein coding genes: Group of genes Name of genes Subunits of ATP synthase atpA, atpB, atpE, atpF, atpH, atpI Subunits of NADH-dehydrogenase ndhA, ndhB, ndhB_copy2, ndhC, ndhD, ndhE, ndhF, ndhG, ndhH, ndhI, ndhJ, ndhK Subunits of cytochrome b/f complex petA, petB, petG, petL, petN Subunits of photosystem I psaA, psaB, psaC, psaI, psaJ Subunits of photosystem II psbA, psbB, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbL, psbM, psbN, psbT, psbZ, ycf3 Large subunit of ribosome rpl14, rpl16, rpl2, rpl20, rpl22, rpl23, rpl23_copy2, rpl2_copy2, rpl33, rpl36 Small subunit of ribosome rps11, rps12, rps14, rps15, rps16, rps18, rps19, rps2, rps3, rps4, rps7, rps7_copy2, rps8 DNA dependent RNA polymerase rpoA, rpoB, rpoC1, rpoC2 Subunit of rubisco rbcL c-type cytochrom synthesis gene ccsA Envelop membrane protein cemA Maturase matK Protease clpP Subunit of Acetyl-CoA-carboxylase accD Translational initiation factor infA Conserved open reading frames ycf1, ycf15, ycf15_copy2, ycf2, ycf2_copy2, ycf4 Other genes None Possible missing protein coding genes: Group of genes Name of genes Subunits of ATP synthase None Subunits of NADH-dehydrogenase None Subunits of cytochrome b/f complex petD Subunits of photosystem I None Subunits of photosystem II None Large subunit of ribosome rpl32 Small subunit of ribosome None DNA dependent RNA polymerase None Subunit of rubisco None c-type cytochrom synthesis gene None Envelop membrane protein None Maturase None Protease None Subunit of Acetyl-CoA-carboxylase None Translational initiation factor None Conserved open reading frames None Other genes None 2. Multicopy genes with different lengths: Genes Length Location None None None 3. Common intron-contained genes: Genes exons exon1 exon2 exon3 atpF (2/2) [13125:13269](-) [11939:12345](-) clpP (3/3) [75824:75894](-) [74728:75018](-) [73811:74054](-) ndhA (2/2) [124985:125537](-) [123347:123885](-) ndhB (2/2) [100388:101162](-) [98926:99683](-) ndhB_copy2 (2/2) [145418:146192](+) [146897:147654](+) petB (1/2) [79567:80262](+) rpl16 (1/2) [85158:85568](-) rpl2 (2/2) [89461:89863](-) [88374:88804](-) rpl2_copy2 (2/2) [156717:157119](+) [157777:158207](+) rpoC1 (2/2) [23771:24202](-) [21381:22997](-) rps12 (3/3) [73555:73668](-) [102573:102804](-) [102011:102036](-) rps16 (2/2) [6244:6284](-) [5177:5402](-) ycf3 (3/3) [47127:47252](-) [46061:46288](-) [45157:45303](-) ### The exon column shows the number of (actual exons/expected exons). ### 4. Genes with internal stop codon: None 5. Genes with nonstandard start codon: Genes Start codon cemA GTG rpl16 ATC rpl2 ATA ndhD ATA rpl2_copy2 ATA 6. Genes with nonstandard stop codon: Genes Stop codon None None 7. Conclusion This genome has 129 genes (110 unique genes). Including: 84 protein-coding genes (78 are unique) 37 tRNA genes (28 are unique) 8 rRNA genes (4 are unique) Details are as follows: 84 protein-coding genes (78 are unique) Genes Length Location accD 1503 [60381:61883](+) atpA 1524 [10330:11853](-) atpB 1503 [55958:57460](-) atpE 402 [55560:55961](-) atpF 552 [13125:13269](-) [11939:12345](-) atpH 246 [13720:13965](-) atpI 744 [15188:15931](-) ccsA 978 [117788:118765](+) cemA 690 [64766:65455](+) clpP 606 [75824:75894](-) [74728:75018](-) [73811:74054](-) infA 249 [83720:83968](-) matK 1518 [2528:4045](+) ndhA 1092 [124985:125537](-) [123347:123885](-) ndhB 1533 [100388:101162](-) [98926:99683](-) ndhB_copy2 1533 [145418:146192](+) [146897:147654](+) ndhC 363 [53323:53685](-) ndhD 1482 [119009:120490](-) ndhE 306 [121271:121576](-) ndhF 2238 [113612:115849](-) ndhG 531 [121810:122340](-) ndhH 1182 [125539:126720](-) ndhI 507 [122762:123268](-) ndhJ 477 [52006:52482](-) ndhK 678 [52592:53269](-) petA 924 [65704:66627](+) petB 696 [79567:80262](+) petG 114 [69698:69811](+) petL 96 [69421:69516](+) petN 90 [29959:30048](+) psaA 2253 [42223:44475](-) psaB 2205 [39993:42197](-) psaC 246 [120668:120913](-) psaI 111 [62615:62725](+) psaJ 129 [70723:70851](+) psbA 1062 [263:1324](-) psbB 1527 [76354:77880](+) psbC 1422 [36352:37773](+) psbD 1062 [35343:36404](+) psbE 252 [68307:68558](-) psbF 120 [68178:68297](-) psbH 222 [78472:78693](+) psbI 111 [7827:7937](+) psbJ 123 [67784:67906](-) psbK 186 [7226:7411](+) psbL 117 [68039:68155](-) psbM 105 [30932:31036](-) psbN 132 [78228:78359](-) psbT 102 [78063:78164](+) psbZ 189 [38549:38737](+) rbcL 1428 [58219:59646](+) rpl14 369 [84653:85021](-) rpl16 411 [85158:85568](-) rpl2 834 [89461:89863](-) [88374:88804](-) rpl20 384 [72352:72735](-) rpl22 360 [87505:87864](-) rpl23 282 [89870:90151](-) rpl23_copy2 282 [156429:156710](+) rpl2_copy2 834 [156717:157119](+) [157777:158207](+) rpl33 201 [71310:71510](+) rpl36 114 [83484:83597](-) rpoA 1020 [81855:82874](-) rpoB 3213 [24229:27441](-) rpoC1 2049 [23771:24202](-) [21381:22997](-) rpoC2 4149 [17088:21236](-) rps11 417 [82951:83367](-) rps12 372 [73555:73668](-) [102573:102804](-) [102011:102036](-) rps14 303 [39551:39853](-) rps15 273 [126844:127116](-) rps16 267 [6244:6284](-) [5177:5402](-) rps18 324 [71771:72094](+) rps19 210 [88016:88225](-) rps2 705 [16159:16863](-) rps3 525 [86794:87318](-) rps4 606 [48553:49158](-) rps7 468 [101485:101952](-) rps7_copy2 468 [144628:145095](+) rps8 399 [84075:84473](-) ycf1 4626 [128551:133176](-) ycf15 201 [97335:97535](+) ycf15_copy2 201 [149045:149245](-) ycf2 5790 [91425:97214](+) ycf2_copy2 5790 [149366:155155](-) ycf3 501 [47127:47252](-) [46061:46288](-) [45157:45303](-) ycf4 555 [63189:63743](+) 37 tRNA genes (28 are unique) Genes Location trnH-GUG [4:77](-) trnK-UUU [1608:1644](+) [4322:4356](+) trnQ-UUG [6802:6873](-) trnS-GCU [8115:8202](-) trnS-CGA [9129:9159](+) [9910:9969](+) trnR-UCU [10136:10207](+) trnC-GCA [28623:28694](+) trnD-GUC [32245:32318](-) trnY-GUA [32760:32843](-) trnE-UUC [32903:32975](-) trnT-GGU [33745:33816](+) trnS-UGA [38074:38166](-) trnG-GCC [39063:39133](+) trnM-CAU [39323:39396](-) trnS-GGA [48180:48266](+) trnT-UGU [49533:49605](-) trnS-UGA_copy2 [50281:50316](+) [50805:50853](+) trnF-GAA [51220:51292](+) trnV-UAC [55044:55080](-) [54414:54439](-) trnM-CAU_copy2 [55261:55333](+) trnW-CCA [69957:70030](-) trnP-UGG [70212:70285](-) trnI-CAU [90317:90390](-) trnL-CAA [98257:98337](-) trnV-GAC [104689:104760](+) trnI [106763:106794](+) [107538:107577](+) trnA-UGC [107642:107678](+) [108482:108517](+) trnR-ACG [112252:112325](+) trnN-GUU [112957:113028](-) trnL-UAG [117596:117675](+) trnN-GUU_copy2 [133553:133624](+) trnR-ACG_copy2 [134256:134329](-) trnA-UGC_copy2 [138901:138937](-) [138062:138097](-) trnI_copy2 [139785:139816](-) [139002:139041](-) trnV-GAC_copy2 [141819:141890](-) trnL-CAA_copy2 [148243:148323](+) trnI-CAU_copy2 [156190:156263](+) 8 rRNA genes (4 are unique) Genes Location rrn16S [104988:106478](+) rrn23S [108671:111475](+) rrn4.5S [111580:111682](+) rrn5S [111894:112014](+) rrn5S_copy2 [134565:134685](-) rrn4.5S_copy2 [134897:134999](-) rrn23S_copy2 [135104:137908](-) rrn16S_copy2 [140101:141591](-) This genome has 19 genes with intron(s). Details are as follows: 16 gene(s) have 1 intron(s) Genes Exons rps16 [6244:6284](-) [5177:5402](-) atpF [13125:13269](-) [11939:12345](-) rpoC1 [23771:24202](-) [21381:22997](-) rpl2 [89461:89863](-) [88374:88804](-) ndhB [100388:101162](-) [98926:99683](-) ndhA [124985:125537](-) [123347:123885](-) ndhB_copy2 [145418:146192](+) [146897:147654](+) rpl2_copy2 [156717:157119](+) [157777:158207](+) trnK-UUU [1608:1644](+) [4322:4356](+) trnS-CGA [9129:9159](+) [9910:9969](+) trnS-UGA_copy2 [50281:50316](+) [50805:50853](+) trnV-UAC [55044:55080](-) [54414:54439](-) trnI [106763:106794](+) [107538:107577](+) trnA-UGC [107642:107678](+) [108482:108517](+) trnA-UGC_copy2 [138901:138937](-) [138062:138097](-) trnI_copy2 [139785:139816](-) [139002:139041](-) 3 gene(s) have 2 intron(s) Genes Exons rps12 [73555:73668](-) [102573:102804](-) [102011:102036](-) ycf3 [47127:47252](-) [46061:46288](-) [45157:45303](-) clpP [75824:75894](-) [74728:75018](-) [73811:74054](-)