Monarch geneset OGS2.0

DPOGS204475
TranscriptDPOGS204475-TA1809 bp
ProteinDPOGS204475-PA602 aa
Genomic positionDPSCF300002 + 661174-666986
RNAseq coverage850x (Rank: top 15%)
Annotation
HeliconiusHMEL0062680.075.05% 
BombyxBGIBMGA007816-TA0.066.44% 
Drosophilaraw-PC1e-4554.84% 
EBI UniRef50UniRef50_B4M9N27e-4454.84%GJ17880 n=23 Tax=virilis group RepID=B4M9N2_DROVI
NCBI RefSeqXP_002002497.12e-4556.13%GI12397 [Drosophila mojavensis]
NCBI nr blastpgi|1951159065e-4456.13%GI12397 [Drosophila mojavensis]
NCBI nr blastxgi|1951565713e-5729.42%GL26218 [Drosophila persimilis]
Group
KEGG pathway 
Orthology groupMCL15083 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204475-TA
ATGGAGGTCCTGCGGCTATTAAAAATTATATTGTTTGTTGGAAATATAGATGATGTCAGTGAAGAGGTCGTTGACCTCACGTGCAGCGAGGCGTCGCCACCTCCTTGCGAGGTGTTCCTCGGGGGATCCTGCAACCCCACAACTTGGCGGTCAGACATCGCTATACCGATGCTCAAGAAGATGGGCATCACGTATTTCAATCCACAAGTAGAGGACTGGTCGACGGAGCTGATAGAGGTAGAACATCGTGCGAAGGCGGAGGCGAGAGCGTTACTGTTCGTGTTGGACAGCGAGACGAGGGCTGTGGCGGCCAGCGTAGAGGCAGCACACCTCGCTGCCGCGCCCAGAGACCTCCTGCTGGTGTTGAGGCCCTACTCCAGACATCAGACTATAGGACGAGAGACTATATCGGATCTGGAATACGTAGAGCTGTCTCGAGCCCGAGCCACATTACAAGAGGCTGTAGAGCGGCGCGGTCTGCCGGCCTTCACAGATATTCCGGCGGCGCTGCGGTGCGCGGGCGCAGTGCTTCGCGGCGCCCGTACGCACCCCCGACACTCGTTGGGCCACACAATCCTACGGTTGAAACGCGCGTACGATGCAGCCGGCGGTAGAAACGCTCGTCTGCCGCGCTCTAGAGCAGTCGACGCTCTTAAGGAGGCGACTCGAACGCCGCGTGACGTCGCCGAGCGCTGCTTACCGCCTAATGCTACCACTGTGGACTTTGAAACGTTCTGTGCCGCTGTTGCCGAATTAGCCTCTGATAATGGACCCCATACTCCGCGCTCGCCGTCCGTGTCGAGCGTCGCCTCGCGTGTCCGCCGCGCGTTCCGCTCGCTCCGTGATTTACTAACGTCATCTACTTTGACGTCTGATTACCGTAACGGCCAAGAGGAGCGTTCAGATCGCCCTGAAGCGAGCAGCTCGGAGAGGTCCCGCGAGAATGGACTTAGGGTTCACAACGCCCGGCTGAGGGCCTACGGGCAGAAACTATCTCTATATATACCTAAGAACGTCAGTCGTACGGAGGAGGGGTCAGCTCCCAAGTCTGGTGATTCAGTGTTAACGCCGGGCACGGAGCGCCGCCTCGAGCGTCTGCCCTCCATGGTGGCGCACACGCACGACGTCTACCTAGGAGGAAACTTCCCGACAAACAGCACCAGACCGGAGGAGGTGCTCCGCCGCGAGGGTTACACGTACGTGGTCCCGCGAGCTAACGACTATACACGGATGTTCTCAGCACCCGCAAGGCGGACAGCGCCTACCACGGATTCGCCATGCAGGGACAAGAAACCGCGGCCCGACCACGACCACGACCACGATGACCACGGTGACCACGGTGACCACAACCCTGAGAGAACGGAACGACCCTCACCCACCGCTCTCTCACCGACGGATGTGGTGTTCAGGGAGAAGACTGCTGACGATAGAACAGCTGATCGCTTGAGCGCCTCAGACTTCTATAAAGTCTCGGAGGATATAACACCGCAGCCTTTCAAAGGTACATATGATGAGGAGCAGTTGGCTGGGTCTCGTGTGGCGTTGTTCAGCCTGAGCGCGGAGGCGCCGGGGTTTGCGGCCATGGTGTTAGCGGCGCACCACATGGGGCTGCGAGCCAATAACACTGTACTGCTAGTACAAACCATGGACCGAGATAGAGCGCCCAAGTACAGCGAAGCGGCCGTTAAGGACTACAACCGCGGCCGACACTACCTTATAGATCTGGCTCGCCGAGCCGCCGTGCCTGTGTTCGACAACCTGGACGCGGCCATGGCCTGCGTTATAGACAAACTGAGGACCACCAACTAA

Protein sequence:

>DPOGS204475-PA
MEVLRLLKIILFVGNIDDVSEEVVDLTCSEASPPPCEVFLGGSCNPTTWRSDIAIPMLKKMGITYFNPQVEDWSTELIEVEHRAKAEARALLFVLDSETRAVAASVEAAHLAAAPRDLLLVLRPYSRHQTIGRETISDLEYVELSRARATLQEAVERRGLPAFTDIPAALRCAGAVLRGARTHPRHSLGHTILRLKRAYDAAGGRNARLPRSRAVDALKEATRTPRDVAERCLPPNATTVDFETFCAAVAELASDNGPHTPRSPSVSSVASRVRRAFRSLRDLLTSSTLTSDYRNGQEERSDRPEASSSERSRENGLRVHNARLRAYGQKLSLYIPKNVSRTEEGSAPKSGDSVLTPGTERRLERLPSMVAHTHDVYLGGNFPTNSTRPEEVLRREGYTYVVPRANDYTRMFSAPARRTAPTTDSPCRDKKPRPDHDHDHDDHGDHGDHNPERTERPSPTALSPTDVVFREKTADDRTADRLSASDFYKVSEDITPQPFKGTYDEEQLAGSRVALFSLSAEAPGFAAMVLAAHHMGLRANNTVLLVQTMDRDRAPKYSEAAVKDYNRGRHYLIDLARRAAVPVFDNLDAAMACVIDKLRTTN-