Monarch geneset OGS2.0

DPOGS210040
TranscriptDPOGS210040-TA1263 bp
ProteinDPOGS210040-PA420 aa
Genomic positionDPSCF300017 - 1400938-1424947
RNAseq coverage174x (Rank: top 50%)
Annotation
HeliconiusHMEL0064675e-8190.51% 
Bombyx% 
DrosophilaCG18769-PC4e-8248.15% 
EBI UniRef50UniRef50_UPI00017924E62e-10165.04%UPI00017924E6 related cluster n=1 Tax=unknown RepID=UPI00017924E6
NCBI RefSeqXP_316356.38e-10565.44%AGAP006330-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582956872e-10365.44%AGAP006330-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|910820593e-10267.27%PREDICTED: similar to LOC100037005 protein [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[162-369] IPR0067698.5e-55Coiled-coil domain containing protein 109, C-terminal
Orthology groupMCL14891 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210040-TA
ATGAAGTTGGAAGATGTGGCGAAGTACGATAATCAATTTTCTTTAATTTCTATTCCTCCTAATATTTATTTATCAAAAGAAGTATTAGTGACATACCGGCGTGGCCTGCCTGTGATAACAGTGCCTCTGCCGTCAAGACGGGAACGTTGTCGGTTTACATTAAGACCAGTGTCGCAAACCGTCGGAGATCTCCTCGAACAGGTGAAAGCGGAAGACCGTGGTGTTGAGCGCGCGGTTGCTTTGGCCGCTGATGAACGTGTGAGAATTGCAGCCAGTGACACCGTAGAGTCACTGCTGGAAAACGACTTCAGGCTCTTGATCAATGACACTGAATACTACGTCAAGAGTCCGCCACAAGTAAAAGTATTAGTGACATACCGGCGTGGCCTGCCTGTGATAACAGTGCCTCTGCCGTCAAGACGGGAACGTTGTCGGTTTACATTAAGACCAGTGTCGCAAACCGTCGGAGATCTCCTCGAACAGGTGAAAGCGGAAGACCGTGGTGTTGAGCGCGCGGTTGCTTTGGCCGCTGATGAACGTGTGAGAATTGCAGCCAGTGACACCGTAGAGTCACTGCTGGAAAACGACTTCCGGCTATTGATCAATGACACTGAATACTACGTCAAGAGCCCGCCACAAGAACGTCTAAGCACCGAAGAGATAACTCGTCTAAGCGATGTTCGTAATCTCGTTAACCAGCTGTACGAGGCTCTGAACGTCCGTGAGCACCAGATCAGAAAAGAACGTGAACTGAGGAGCCAGCTGGAAAAACTAACAGCCGAGCTGCAGCCTTTAGAAGAGAAACGCATGACGTTAGAGCATGAGACGGCTCGTTCGACGTCAGCCCTCACTTGGGTGGGTTTGGGTCTTATGGGGGTTCAGTTCGGGGTTCTGGCTCGTCTAACCTGGTGGGAATACTCCTGGGACATCATGGAGCCGGTCACGTACTTCGTGACTTACGGAACGGCCATGGCGGCGTACGCCTACTTCGTGCTGACGAAACAGGAGTACATTCTACCTGATGTCAAGGATAGACAGCATCTGATCACTTTGCACAAGAAGGCGAAAAAAATTGGTCTCGACATAAATCAGTACAACCATCTTAAAGATGAGGTTGACAAATTACAAAAGGATCTAGCTCGTCTGCGGGACCCCCTTCAGATACATCTGCCAGTGAACAGAATGGACGAAGCGAAGCGCTCCCCGCTCACCAAGATCAAAGACATGCTCGAAGAGACGACCAAAAAAATGAAAATAACGTAA

Protein sequence:

>DPOGS210040-PA
MKLEDVAKYDNQFSLISIPPNIYLSKEVLVTYRRGLPVITVPLPSRRERCRFTLRPVSQTVGDLLEQVKAEDRGVERAVALAADERVRIAASDTVESLLENDFRLLINDTEYYVKSPPQVKVLVTYRRGLPVITVPLPSRRERCRFTLRPVSQTVGDLLEQVKAEDRGVERAVALAADERVRIAASDTVESLLENDFRLLINDTEYYVKSPPQERLSTEEITRLSDVRNLVNQLYEALNVREHQIRKERELRSQLEKLTAELQPLEEKRMTLEHETARSTSALTWVGLGLMGVQFGVLARLTWWEYSWDIMEPVTYFVTYGTAMAAYAYFVLTKQEYILPDVKDRQHLITLHKKAKKIGLDINQYNHLKDEVDKLQKDLARLRDPLQIHLPVNRMDEAKRSPLTKIKDMLEETTKKMKIT-