Monarch geneset OGS2.0

DPOGS215135
TranscriptDPOGS215135-TA1299 bp
ProteinDPOGS215135-PA432 aa
Genomic positionDPSCF300427 - 9670-13876
RNAseq coverage25x (Rank: top 77%)
Annotation
HeliconiusHMEL0225525e-15968.09% 
BombyxBGIBMGA001678-TA9e-14165.23% 
Drosophilanvd-PA3e-7538.23% 
EBI UniRef50UniRef50_Q1JUZ26e-15862.84%Rieske-domain protein Neverland n=3 Tax=Obtectomera RepID=Q1JUZ2_BOMMO
NCBI RefSeqNP_001037626.11e-15862.84%neverland [Bombyx mori]
NCBI nr blastpgi|3010727463e-15861.84%Rieske-domain protein neverland [Spodoptera littoralis]
NCBI nr blastxgi|3010727465e-15961.84%Rieske-domain protein neverland [Spodoptera littoralis]
Group
Gene OntologyGO:00515371.4e-292 iron, 2 sulfur cluster binding
GO:00551141.4e-29oxidation-reduction process
GO:00164911.4e-29oxidoreductase activity
KEGG pathwayrha:RHA1_ro024902e-26 
 K00517 (E1.14.-.-)maps-> Naphthalene and anthracene degradation
    Stilbenoid, diarylheptanoid and gingerol biosynthesis
    Limonene and pinene degradation
    gamma-Hexachlorocyclohexane degradation
InterPro domain[77-215] IPR0179411.4e-29Rieske [2Fe-2S] iron-sulphur domain
Orthology groupMCL16272 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215135-TA
ATGGCGGCCCATGACGCATTGACGTTTGACGCGAACCCGTGTCACGACCCGTTTTACGTTTTGAACGTTCTGTTTCACGCAGTTGCAGATTACGGCTTCAGATATTTCAAATTAGGGATATTTCTCTCGATACTTATAATAATATGTTTAATTATATACAAATCTTACTGCTCTCCGGTTATATACAAGAAGGAATTATCTGAAGTAGGCTACGAACACCTCTCCAAGGGGTCAGATAGACCCCTGCACATCCTGAGAGCTCAGAATACAAGACGTTTGGGGGATAAACTGCCCCCGCCCTACCCTAATGGATGGTTCGCTCTGGTTGAGAGTCGGGACTTGAAGGTCGGGTCAGTAATACCCGTGGATGCTATGGGTCTTAACTTCTGCGTATACCGAGGGGAGGACGGTGTTGCCAGGATAGTGGACGCCTATTGTCCCCACTTGGGTGCGAATTTAGCCGTGGGTGGAACCGTCTGTGGTAATTGCATAGAGTGTCCTTTCCACCAGTGGAGGTTTGGGGAGAATGGAGATTGCGTGAGCATACCTAACGTTGAGGCGGTACCAAAAGGCATATCCATCAAGACCCACCACGCCATGGAAATTGATGGGGCGGTGTGGGTGTGGTATGATGTCGAAGGTCGGGAGCCTCTATGGACGGTGGACAGAATCCCGGAACTAGACACGTGGGGATACAGGGGACGGAACGAGTTCATAGTTAACGCTCACTTGCAGGAAATACCCGAGAACGGTGCGGATGTTGCTCACCTGAACGCGGTGCACACAGTCTCCATGCTGAGTGACGTCGGGTTTAAATATCCATTCCTCAATCATTTTATTGGCTACCACACTTGGAACGCTGAATGGTTGAAGGGTGACGACCACACCGCCTCTATGAAAATAACTCAAAAATACCTCATCATGAAATTAGACATCTTCCCGATAGATGTCACTGTGACACAGATAGGTCCAGCGCACGTTCGTCTTATGTTCACCTCTCCTCTGGGCCCCATGGTTGTTCTTCAGTCAGTGACGCCGCTCGGACCTCTGTTGCAGCGCGTGATACATCGCGTGTACACCCCCACGTTGAACGCGCCGCTGGGCGCTGCACTAGTCGTTTTGGAAGCCTACCAGTTCCAACGCGACGTTGCGATATGGAACAGCAAGAGATACGTCAATTCACCTACTTACGTCAAATCGGACAAAACGATACGTGCTTTCAGAACATGGTTCTCTCAGTTCTACAGCAAGAACAGTATACCACTGAGAGACGCTATGCAGAACCCATTGGACTGGTAG

Protein sequence:

>DPOGS215135-PA
MAAHDALTFDANPCHDPFYVLNVLFHAVADYGFRYFKLGIFLSILIIICLIIYKSYCSPVIYKKELSEVGYEHLSKGSDRPLHILRAQNTRRLGDKLPPPYPNGWFALVESRDLKVGSVIPVDAMGLNFCVYRGEDGVARIVDAYCPHLGANLAVGGTVCGNCIECPFHQWRFGENGDCVSIPNVEAVPKGISIKTHHAMEIDGAVWVWYDVEGREPLWTVDRIPELDTWGYRGRNEFIVNAHLQEIPENGADVAHLNAVHTVSMLSDVGFKYPFLNHFIGYHTWNAEWLKGDDHTASMKITQKYLIMKLDIFPIDVTVTQIGPAHVRLMFTSPLGPMVVLQSVTPLGPLLQRVIHRVYTPTLNAPLGAALVVLEAYQFQRDVAIWNSKRYVNSPTYVKSDKTIRAFRTWFSQFYSKNSIPLRDAMQNPLDW-