Monarch geneset OGS2.0

DPOGS205681
TranscriptDPOGS205681-TA3147 bp
ProteinDPOGS205681-PA1048 aa
Genomic positionDPSCF300023 + 1046033-1057115
RNAseq coverage49x (Rank: top 70%)
Annotation
HeliconiusHMEL0073492e-10167.92% 
BombyxBGIBMGA001028-TA3e-15049.02% 
DrosophilaCG11436-PA7e-3128.89% 
EBI UniRef50UniRef50_D2A0N13e-9430.66%Putative uncharacterized protein GLEAN_08239 n=2 Tax=Tribolium castaneum RepID=D2A0N1_TRICA
NCBI RefSeqXP_002411375.13e-6135.18%conserved hypothetical protein [Ixodes scapularis]
NCBI nr blastpgi|2700060861e-9330.66%hypothetical protein TcasGA2_TC008239 [Tribolium castaneum]
NCBI nr blastxgi|2700060861e-9430.10%hypothetical protein TcasGA2_TC008239 [Tribolium castaneum]
Group
Gene OntologyGO:00054883e-15binding
KEGG pathway 
InterPro domain[205-313] IPR0119903e-15Tetratricopeptide-like helical
Orthology groupMCL16726 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205681-TA
ATGATGGCTGCAAGATTTATTTCATTCTTTTGTGTATTTATGATGTATTTTATCGGAACTCAAGCTTCTAATCACTGGATGGTCACTGAAAGTGGTTTAATTCAACCTAGGATTGACTCGCCTTTCAACTTGGATCGCCCTTACGATCTCTTAGCATTTTTAAATCAGGAAAAAAGATGGGATACTGTTTTTGAAATTTATAATGATCTATCAAGTAGACAAGCAATTATTGATACTCTTTGGGCTGATGTTGAAAAAGACACTAATATGGGAGTTAAAATAGCGCATAATGAACATTGTGTTAAAGCTGGTAATGTGAACGTTATAGATTGGTATGCGGCTCTGTTAGAAGATGGATCAAAGAAGATCTCTGGCAAAGAATTTCTTCTGCCTATTCCTTATTATGGCCCTAATACTGACATGCCAGATTGCAAACGAATCTCTTCATTGGCATTTAGTATGTTTGCATTTGAACATTTAGAAGGTATGTTACAACGAGACAATCTCACTGTGAATCCTGAATTTGTCCTGCCAGAGCTGATATCTCCCATCATGACCCTAGATCAATTTGGTCACTGGTTGACAACTATGCTTCAACGCAATAACTCCTCCTGGCTGTATTATCATATGGCCTCACTATATTGGCGGATTAGGGGAAACGCACCTAAAGCTATTGAGTGTAGCCGACGTGCTTTGCACTATGTACCGAGGGTGTATAAAGATATTGCTCTTGGAAGTCTTGGAATGATTTTACACAGAAGCAGCAAAACCAACGATGCTATAGTGGTTTTAAATGCAGCTATCGATCATGATCCAAATAACTATGTTAGTCATTTTGCTATTGCAAATGCCTATACAGTCATTGGAGATTTCAATACTTCTATAAAGTATTATGATAAAACACTTAAGTTAAATCCGAGGATGGAATTGGCTGCGAAACATAAGTCTGGAACGCTGTGCCATGCGAAATTAGGATTAAGAATTAAGTCAATTAGACAGACATTTAACAAGTTGCGTGAAGAATTAAAGGAATATACCAAGAAGGAAACGAAATATCTTAAAGTGCAGGCTGAGTTCCTTGGAACTATAAGACATCCTGATGATTTCGAATACAGAAACGTAGACAAAACATTCGAACGAATGGCTGAGATAACGGGTTTAAAAATGAAAGATATGAAAATGAAAATTGATAAAAATTCCCTTATAAAGTATTTTCTGGACGGTCTGATATATAACGACGACAAGTTAGCCAGAGCCGGCGTCGATGGCATCGATACGATATACAGCTTAGAGAGGCTTGTCATACACATCAATACAAATTCAAACGGTCAGAACGAACCGTTCCATGACCACCCCAGTTTCTCTATAAACATTAAGGAGAAAGTCAAACCTTCTGAACAAAAAACTGCGAAACCGGTTTCTCGAGATGACAAGAAAGAATACCATCTGGTCATTGAGAAGAAACCTACTGCGGATGAAGAATTGTCAGAATTTGAGACTGGAATTATTATGTATCCCCCAACTATAACAATAAATAGGAACATAGAGGACTTTGACAAGGAGATGGAGTGGCCATCGAATAAACTTTGCAAGGAATCAGCACATAAATTCCCAGAGAATGTTGAGGCTATATTCCCTGTCTTCTTGCCTTTCGAGAACAAAGGCATCAGACCTCAATTCACATATGAAGTGGTGGACACGGGGTTTTTGAGGCAAAAGCTATTGGAATATGTGAGCGATGGAAAGAGTGAGGACGCGGCACACATGCAGGACGCCGAAATAGGGCATAGGATATACATCGCTATGAAGAAGAAACTGGCGCCACGTTGGTTGATACTAACTCTCTCATCTTTGTACTGGAGAGTCCGAGGCCAGCCGTGGTCCGCTCTGAGTTGTCTTCGCGCCGCTGTTAAGGTGGCGAAGCCGCGCTATAAGGACCTGGTGCTGGTTTCTCTAGCCTCCGTACAGTTGGAAATAGGTCTGGCTGATGAAGCGATGACTAACGCCGAAGAGGCCTTCCGCATGAGCTTCTATGAGCCAGCTACAAATTTCCTGATTGCTGAACTGAGTATGCTCCGCAAACATCGGAACACACACATGTTCCACTTGAAGCAAGTGGTTAGAGTTGAACCAGGGTTCATGGGCGGACTTGCCAGGGAGCTTCTGCTGGCCTGGGCCTGTATACTGAAACAGGTCGTCGCTCTGAGAGAGATGGATTACGTGAAGGGCGCCATCTGTACTCAGGTTCAACCGCTCATGGACTTAGTCTGTCAAGAGGACGAAATCAACTGTAAATCGCCCAACATACAATGTTATACCAATCACGACACGAGCTCTCTCGTCCGCATGATGGACGAATCCGATACGGACTCCCTTTTCGCGATCAGTGAAAATTTCTTTGATCCGCTCATCGAGAACACTCCGGCCGATCGCGGTGAGAGATTAGCCCACCACGCCAACTTCGACAGCATGATAACAACCATTGAGTCTATATACTCCGGATGCGGGAACAAGAAATGTGCAAGTGAACTACCCACAGAAATATCATCAATACAAGTCAATAGTAAGAAGATCCCTGAATGTCGACTGCCGGCCGAGTTGGACGACTTCTACTTGGAGAAGATCGCGCGAGCTGATACAGAGGGTTGGAAACCCGTCATGACACTGATGCATCAGTTCTCGGAAATGTTTGACTCATATGACTTCAATACGTTGGGCTCGAAGATCGCGAAGTATGTGGATATGCGTCCTCGTTGGTGGGCGGGTCTGGTGGCGGCCGGCTGGTGGTGTGGTGCCGGTGGGCGCGGGTCGTGCGCCGCTCGCTGCCTCGCCGCCGCTCACAGATACGCTCCAAACAAATACGCTACTTACCCACTAAGATCCCTGGTCGCCATGTTACATATGCAATCAAAACAACAGGACGCCAAACAGATCGCCTACCTGTCTTTCTACATGTCACCCAAGAATAAAATAGAGGCTTTCCTTGTAGCCGTATCACACGCGTATCTGGCCGAGTACGAGCAAGCTATGTGGATGTATCGTTATGCTCTCACTTTCGACGCCGACTTCGTTCCGGCCAAAGCGAGCATACATTCAACGATATGCCTCCTTTTATATCGTGACGGGAAAGCACAATTTATGGAATAA

Protein sequence:

>DPOGS205681-PA
MMAARFISFFCVFMMYFIGTQASNHWMVTESGLIQPRIDSPFNLDRPYDLLAFLNQEKRWDTVFEIYNDLSSRQAIIDTLWADVEKDTNMGVKIAHNEHCVKAGNVNVIDWYAALLEDGSKKISGKEFLLPIPYYGPNTDMPDCKRISSLAFSMFAFEHLEGMLQRDNLTVNPEFVLPELISPIMTLDQFGHWLTTMLQRNNSSWLYYHMASLYWRIRGNAPKAIECSRRALHYVPRVYKDIALGSLGMILHRSSKTNDAIVVLNAAIDHDPNNYVSHFAIANAYTVIGDFNTSIKYYDKTLKLNPRMELAAKHKSGTLCHAKLGLRIKSIRQTFNKLREELKEYTKKETKYLKVQAEFLGTIRHPDDFEYRNVDKTFERMAEITGLKMKDMKMKIDKNSLIKYFLDGLIYNDDKLARAGVDGIDTIYSLERLVIHINTNSNGQNEPFHDHPSFSINIKEKVKPSEQKTAKPVSRDDKKEYHLVIEKKPTADEELSEFETGIIMYPPTITINRNIEDFDKEMEWPSNKLCKESAHKFPENVEAIFPVFLPFENKGIRPQFTYEVVDTGFLRQKLLEYVSDGKSEDAAHMQDAEIGHRIYIAMKKKLAPRWLILTLSSLYWRVRGQPWSALSCLRAAVKVAKPRYKDLVLVSLASVQLEIGLADEAMTNAEEAFRMSFYEPATNFLIAELSMLRKHRNTHMFHLKQVVRVEPGFMGGLARELLLAWACILKQVVALREMDYVKGAICTQVQPLMDLVCQEDEINCKSPNIQCYTNHDTSSLVRMMDESDTDSLFAISENFFDPLIENTPADRGERLAHHANFDSMITTIESIYSGCGNKKCASELPTEISSIQVNSKKIPECRLPAELDDFYLEKIARADTEGWKPVMTLMHQFSEMFDSYDFNTLGSKIAKYVDMRPRWWAGLVAAGWWCGAGGRGSCAARCLAAAHRYAPNKYATYPLRSLVAMLHMQSKQQDAKQIAYLSFYMSPKNKIEAFLVAVSHAYLAEYEQAMWMYRYALTFDADFVPAKASIHSTICLLLYRDGKAQFME-