Monarch geneset OGS2.0

DPOGS212314
TranscriptDPOGS212314-TA2238 bp
ProteinDPOGS212314-PA745 aa
Genomic positionDPSCF300019 - 1153625-1160809
RNAseq coverage669x (Rank: top 19%)
Annotation
HeliconiusHMEL0133895e-5358.65% 
BombyxBGIBMGA012037-TA2e-13638.03% 
Drosophila% 
EBI UniRef50UniRef50_E3WSZ79e-3225.91%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3WSZ7_ANODA
NCBI RefSeqXP_001648329.12e-2325.93%hypothetical protein AaeL_AAEL014265 [Aedes aegypti]
NCBI nr blastpgi|3123814443e-3125.91%hypothetical protein AND_06245 [Anopheles darlingi]
NCBI nr blastxgi|3123814446e-3421.78%hypothetical protein AND_06245 [Anopheles darlingi]
Group
KEGG pathway 
Orthology groupMCL24802 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212314-TA
ATGGAACAAATATTTAAAAGAGTTCGGCATGTTGATCCAAATAACAGTGATGCTGTGAAAGAAGTACTGTCAAACTTTTTCCAAAATTTGCCGAAGCGCAATGATATTAACAATAAAAAGTTTTTGGACGAGTTATTGATTATAATTAACCGGTATCCTAAATATTGTATGTCATATCGTAATATTATTGAGTCATTTATAAGTAATTACTTAAGCTCCAACAATTACTACAATGTTATTAAAGCAGCTAAGTGTGCACATGCACTACAACAGGTTCGTCCTATTCAAGATAAGACAGCAACGCCGAAATCTTGTTGGCGGCAGCAGATGAACTCATTGTGTAATGCTGCACACTCACTCATAGAAGTGATATTTGCTGATGCTGTCGACATTTACAGAAGTAATGCTAAGCCTACAGAGAGTCATTCAAATTCTCCGTTATCAACTATACTGGCAAATATCGTTAAGAAGTCCCAAGACAAAGACAGATCTCAGTTGATGATGACAAGACTCAAGAACGTATTCACATTCATACAGGCTATGTTGGTAGAAATATATCCTGTACCGAAACCAATCCAGCCTCGTTCAATACTAGATGTGATAGTGAGGTCTCTGAGTGTGAGCAGTCAACACGTCTCTCTGGACGTAGCCTCTGTTAAAGTGCAGGCCTTGAAGACGTTGGATGCTATGATTCTGTGTCTGGGATCAAACCTGATACCATATTCACCTCTAGTCTTTAGACTTGCGACGCAGACTTTGAGATGGACCTCGGACAACATGAGCAGAACTACCGGCAAAGTTCGTTGCACAGCGTACAGTACGCTGAGCAAGTGGCTTCTAACACTGCACATTTATAAGATGCCAGAAAAAAATACCTGGGAAGATGATCTGACGGCGCATGTCGTGAGGGACGTGACCCCGGCGAAGAGAGTTGTGGCGCTTACGATGGGGCCGCAGCCGACTAAAAATTTAAGCAAAAAAGCCAAAAGGAAATTAGCTAATTCTCAACTGTTGCAAAGCTCAATCGCCGCTCACATGCCCGGGGAGAAAAATAAGATTGATATTCCAGAGGAAGTGAACAATGAGGTCACGGTATCAGCCTTGCAGTTCGCAGAAGTCTTCTTCACTGTTTGTGGAAGATTCCTCAAACCGGCCACACATAAGTTATTCCAAGAGCGTGTCATCCGCCATCACTCAGCAGGTGAGACGTTGCTGTACCTGCGAGTGCTGGAGGCGAGTCGCAAGACGACGCCGGCGACCGTGGCGCCACCGACACAGTACTGTCTTCATATATACAGTACGCTTGTGAACAGCTCCGACGCCGAGATATCAAAATTCTGTAGCCAAGCTCTACTGGATATAAGACTGCACCTATATTGCTCGCCGCCGTCCATCAACCTGGCTATAGAAATACCTCAAGATGAAGAGGAAACGGCGAATAAAAGGAAGAAGGTCTCGTCCAAAAACAGGGCCATGTTAGAGTATCTATTAGGGCCGGATAAAGTGCCCCGAGATAAAGAAGACGATATTATAACGATTCCAGACGAACCGTCGAATAAGAAACAACGTGTCGACGAATTGGATAGAATAAGTCTAAGCAGCGATTCCACCAGCACTGTTAAGATACCGTACGGAACAGACATCAGCTCAGACTCGGACGGAGATAACGTCATGGAGGTCGACGTAGTCGTCGAAATGAATCACACCACAGGCAGAGAGAGAGTTCTAGAAGCCGCCGACGTTCCGAAGATATCCATAAACGACGAAAAGCAATCATCCGATCAAATAAGTCTAAATGATATAACAAACGATGAGAGCAACCAGGCAGACGCGATTACTAGTGAAGACGTCTCTGATGTTATACATGAAGCGCCTACACAACTGAACACATCAAGTGGCGCCCCCCAAGTAGTGTACGACCATCCGGATACAGGAACCGGCGACGTCACAGTCCTGGAGAGGATTGACGACGAAAATATACCAAACACGAACGATACGGACGAAGATGCGATAACTTGCGGGCAAATCGTACGAAGCTCGCAAGAAATTGTCAACGGAAATAATGAGCCGGAAGTCAATGGTGTTGATAAAATTAATGAGGACAGCGATGTGTATAATATTACCACGAAAGATATAAATAAGGGGGAAGATAATCTTGCTGCCAAAATAGATGGAACCAGTGTGGAAGATATGATGGCGGATTTCGTTGATGAAGTAAATGAAGCTGTAGCTGTGTAA

Protein sequence:

>DPOGS212314-PA
MEQIFKRVRHVDPNNSDAVKEVLSNFFQNLPKRNDINNKKFLDELLIIINRYPKYCMSYRNIIESFISNYLSSNNYYNVIKAAKCAHALQQVRPIQDKTATPKSCWRQQMNSLCNAAHSLIEVIFADAVDIYRSNAKPTESHSNSPLSTILANIVKKSQDKDRSQLMMTRLKNVFTFIQAMLVEIYPVPKPIQPRSILDVIVRSLSVSSQHVSLDVASVKVQALKTLDAMILCLGSNLIPYSPLVFRLATQTLRWTSDNMSRTTGKVRCTAYSTLSKWLLTLHIYKMPEKNTWEDDLTAHVVRDVTPAKRVVALTMGPQPTKNLSKKAKRKLANSQLLQSSIAAHMPGEKNKIDIPEEVNNEVTVSALQFAEVFFTVCGRFLKPATHKLFQERVIRHHSAGETLLYLRVLEASRKTTPATVAPPTQYCLHIYSTLVNSSDAEISKFCSQALLDIRLHLYCSPPSINLAIEIPQDEEETANKRKKVSSKNRAMLEYLLGPDKVPRDKEDDIITIPDEPSNKKQRVDELDRISLSSDSTSTVKIPYGTDISSDSDGDNVMEVDVVVEMNHTTGRERVLEAADVPKISINDEKQSSDQISLNDITNDESNQADAITSEDVSDVIHEAPTQLNTSSGAPQVVYDHPDTGTGDVTVLERIDDENIPNTNDTDEDAITCGQIVRSSQEIVNGNNEPEVNGVDKINEDSDVYNITTKDINKGEDNLAAKIDGTSVEDMMADFVDEVNEAVAV-