Monarch geneset OGS2.0

DPOGS213810
TranscriptDPOGS213810-TA5463 bp
ProteinDPOGS213810-PA1820 aa
Genomic positionDPSCF300106 + 149794-170456
RNAseq coverage619x (Rank: top 21%)
Annotation
HeliconiusHMEL0161560.077.35% 
BombyxBGIBMGA006551-TA0.068.60% 
DrosophilaCG42672-PH0.045.66% 
EBI UniRef50UniRef50_E2AU140.047.24%Ankyrin repeat-rich membrane spanning protein n=4 Tax=Formicidae RepID=E2AU14_CAMFO
NCBI RefSeqXP_001811729.10.051.41%PREDICTED: similar to CG30387 CG30387-PB [Tribolium castaneum]
NCBI nr blastpgi|3838549250.047.58%PREDICTED: kinase D-interacting substrate of 220 kDa-like [Megachile rotundata]
NCBI nr blastxgi|3407250210.047.07%PREDICTED: kinase D-interacting substrate of 220 kDa-like [Bombus terrestris]
Group
Gene OntologyGO:00055154.8e-07protein binding
KEGG pathwaytca:1001416540.0 
 K12460 (KIDINS220, ARMS)maps-> Neurotrophin signaling pathway
InterPro domain[200-587] IPR0206831.4e-81Ankyrin repeat-containing domain
[614-1150] IPR0116461.1e-80KAP P-loop
[277-306] IPR0021104.8e-07Ankyrin repeat
Orthology groupMCL13649 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213810-TA
ATGTCCCAATCACCCGAACTGCCTAAAACAACTAACGGCGAAAATGAATACGTTCACATACGCTGTCTCATCGCCAATATACACAAACCGAGAAATGTGTCGAGGCCCCCGGTGTCAGAGTCGCCACTGGAGGTAGTTAAAGAGTCACCGACGTCTTCTGATGCAGATCCCAGCTGTTCCGAGTCCGATATCTCCGAAAAGTCGGAGATATCAAATGCGAACGATACAAAAAACACTGAAGAAGCTCCTCCAAGACGAGCTAAGAGCAACAGTTTCGCTGGTGCTATGCTCGCTCCTCCAATGTTCAACAAGCGACGGAGGCCCTCCTTCCTTCATCTCGCCGTCGGTGGTGGGAACGAGCACTCCGGGCCTTTGTCAGCAGGGTCACACTTCAACGTGCCACGCTTCACTGTCACAGCACCACCTGGGGAGCATCGCCGTTTCAGTCATGGCTTCCCGTTCCATGGCTTTGCTCTTCGACGTCATTCCAATACGAGCCTTCACCGCACCGAGTCGATGGTGTCCCTGGCTTGTTTCAAGACGCTCTCACAGCTGGTCAACACTGATGGGAACGAGTTGCAGAACTTCCTCTCGTCTAACAGAAATATCAACGTCGACGACAAAGACGATAATGGTACCACCGCGCTCATGGTTGCGAGTGAGAGCGGGCGGTTGTCGGCGGTGCGGTTGTTATTGGGCGCGGGTTCGGACGCGTGTGCCGCGGATGGTGATGGATGGACCAGCCTCGCGTTCGCAGCCAGGGGAGGACACCTAGCTGTCGTACAGGAACTTATTGACGCCGGCGTCGTTATCGACAGCAGGGATTGTGGCGGATGGACGCCTCTCATGTGGGCTTCATACAAAGGTCACGAAGACATCGTCGTCTTGCTCTTGGAGAAAGGCGCCGACGTACACGCACATGGCAACTACAACATAAATTCCTTAGTATGGGCCGCTGGTCGACGTCATAGTGGTGTGGTGTCGCGCCTGTTGTCTGCGGGTGCTCGTCCAAATTCTTGCGACAAGTACGCCACTTCGGCCCTCACGTGGGCGTCCCGGGCTGGAGACACGGCTTCATGCGCCGCGCTCCTACGAGCTGGAGCCGATCCTAACACAGCTGGCATGTACTGCTGGACACCTTTGCTGCAAGCTACTCACGGTAACCACTTCGAAATAGTCCAAATGTTGCTGGAACACAAACCCAACGTGAACGCTTTAGATAAAGATGGTTGTACAGCGCTAGCTATAGCTTGCAAAGAGGGATATTATGATATCTCATTGGCCTTAATAAACTCTGGGGCGTATATTAACGTGCAGGATCACTCAAAGGACACGCCCCTGATATACGCTGTGAAGGGTGGATACAAGAACATAGTGGAGGCTTTATTGAAGAAGCATGTGGATGTAGACCTGCCAGGGAAGGAAAAGAAGACGCCGGTTTATACGGCAGTGGAGAAAGGTCACGTCGCCATATTGAAGCTTCTGTTGGCTTCAAATCCTGATTTGGAACATTGCACTACCAGCGGTGACACGGCCCTCCTGCGTGCTGTGCGGTCCCGGAACGCAGAAATGGTGGCCTTGTTACTGGAGAGACGAGCTCGTGTCGCAGCTGCTGATAACAGGGGAGACACCGCGTTGCACGTTGCTATGAGGGCGCGGTCTAAGCAAATTGTTGAAATCCTCTTGCGGAACCCAAAGAACAGCCAGCTCTTATACAAACCGAATAAGATGAACGAAACTCCATACAACATAGATATGAGTTACAACAAGACGATATTGGGACAAATATTCGGAGCTCGTAAGCTAAACACCAACGAGGACAATGAGAATATGCTCGGATACGAACTGTATAGCGCGGCGTTGGCTGACATGCTCTCAGAACCGAGTCTCTCGGTACCGATCACTGTGGGGCTTTACGCTAGATGGGGTTCAGGGAAATCATTTCTGCTTAATAAACTTAAAGAGGAGATGAAGAACTTCGCTCGTCAGTGGAGCGAGACCGGGTGGTCGTGGTCGTGGGCGGTGTGGTGGGGCGCGTGGCACGCCTCACTGGCGGTGGCGGCGTGTTCCTCTATGGCGGGCGCTCCCTCGCACGTGGCGCTCGCGCTGCTGTTCGTGCTTTTCGCTGCCATGTACCTCGGTTTCTACCTGCTGTGGTATTTAGGGAACAGATATGAGTGGTGGTGGGCGGGCGGTATGGTTGCTGCGTTAGGACGGCGATTCAGCTCTTTATTGTTGCTCCTACAAGTTGTTTTCTGTCACCCGCCAGGCCCAAATGACCCTCGTGCTTTACCCGCAACGCCTATTAGATTTCACTTTACAGAGGGTATGAAGAGTGGCCCGGGACAGGAGGGTGAGGCTATGGTGGTGCAAATGTTAGGAAGTCTCGCTGAGGCTCTGGAGTGTCAGTACGGCAGAGTCTGTACAAGACTGGCGAGAGCGTTCAGACCAAGGCCGTTGTCATCGACATCAGGGTGGAAATGGAGGAAGGCGTGTTGCATTCCACATATAATAACATTCGAGCTGAGCTTCATATGTGTGCTGCTCGGTGTCTGTGTGCTGGTGTTGTATCTCACCGACCCTGAGGAAGATCCAAGTCGTCGCGACGTCCGTCAGGGGGTGATGGTAGGGGCGTGCGCGGCTGCTGGTGCCGTGCTCCTCGCTAACCTGTACGCTGGAGCACGTGCTCTAGCCGCGCTTGCTCTGCCGCCGCGAGCACGTCTCGCACGTGCTCTGAAAAGGGATCACGCTCACACCGTCGCGTTAAGGCCGGAGGTTCAGGCACTCACTCACACCGTGTCGTGTTTGGACGCGTTCACTGGCCAGCAGACACGGCTAGTAGTTGTCGTGGACGCTCTGGACAGTTGTGAGCAGGAGAAGGTTCTGGCGCTGCTGAACGCTGTTCACGCGCTATGTTCTGACCCCCGGAGCCCCTTTATACTGCTCCTGGCAATCGACCCGCACATCATCAGCAAGGCTGTAGAAATAAACAGTCGTCGAGCGTTCTCTGAGAGTAACATCGGAGGTTGGGACTACCTTCGTAACATGGTACAACTGCCTTTCTACCTGCAAAACTCAGCGCTGAGGAGGGTGAAGGTCGCTCAGCAGACTGCCGCTAGACGGATGCAGGCGCTAGCCGTCGACGACTTCAGTACATCGCTACAGAGATCCGTATCAGCCCGTCGGTTGTCGTCTACGTCAGAGTTGATGTCCAGTCAAGAGCGGATCAAGGGTCGCGGGGAGGTGAGGGGTGAGGGTGGTCGCGGTCGCCTCCGTCCGTCAGAGTCCGTGGCGTCCTCGGTGGCGTCGGGTCTGCACCGCCCCGCGCCGGCCCCCGCGGGTGCCGCGGATCTCGGCCGGGTGCTCCTCACTGACGATTACTTCAGTGACGTCAACCCCAGGAGTATGAGGCGGCTCATGAACGTGCTCTACGTCACTGGTCGTCTCCTGAAAGCTTTCCAAATAGAGTTCAACTGGTACCAGCTCGCGTCGTGGGTGAACCTCACGGAACAATGGCCTTTCCGGACCTCCTGGATCATCTATCACCATGAAACATACGAAGAACACATCGAGGACTCCACTTCACTCAAACACATCTATGACAAAGTTAAGCCTTCAATGGGCGGTCTCCGTGAGGCCAGCACACTGATCGAGTTGGATCGCGACGAGCGTAAACTGGAAGTATTCCTGAGCTTCCACCGGTCCACACTCACCGCCGCTGATCTCAAGATATTTCTGCCGTTTACTATAAACTTAGATCCCTATATCAAGAAAGTCATTAAAGAGGAGCACGCGCAGGCGGGTGTCGAGGAAGACCTCGGAGCTAGCGGGGCCGCCTCCATGTACGCGAGCAACAGAACACAGCAGGCTAAACCCTTCCACAAGAAACAGAAAATCGTGGCACAGTCTGTCGTGTCCGGTACACAGCAGTGGTCGAGCTGGCACCACTCCGTGCCGCCGCAGACCTACGTCCAGGAAGCATCACAGTCACAGCCGATGGCGGTCAACCCCGTCGTGCTACTCAAAACTGCTTTCCCTGGATTAGGCGACGTGTCAACACTACGTCTATCCACAATGAGCACGGAGCGTGTGTGCGGGTTGTGTCGCGTGGCGCTGGGTGGGGTGGTGGGAGGAGGGGGAGGGGGCAACACAGCCGCTGCAGCGCTCATGAAGCACAGGGTGTGCGGGCTGGTGCTGACCGTGTGCCGGCTGGATGACCTCAAACCGTTACTAGACTTACCGTTCGGTGATTGGGAGCTTTTCAAAATGCTCATATTGAATCTGCGAGATCTCGAAGCCAGTATGCCGACTAACACGCCGGCCGTCACCGTCATACAAGAGAAACCCGTTGATGCGGAAATAGACGTGAAACAACGACCGTCCCTCGAGCACCAAAGAAGTCGCCCCACCAACGTCGAGAAACAGGTAACACTGGAGGAACAGATGATATGTGGAGCGCTGCAGACCCTGAACGAGGAGGCCATGGAAGACCTGCTGCAGTCGGAGCCCACAGGTGAGGCTCCATCCTTGTCCCGCTCCCCCTCTCCCTCCCGCAGCCCCTCCCCCTCCCCCTCGGAGGCCGAGCCCCGCGCACCCCACGTGGTCGTCAATCTGTCCGACGATGTCTTCCTAGGGGTCGCGGCCGCCTCGACCGGAGCTCATGACGGTGTTGTTACAGCCGGGGCTCCAGCGACCGCGGACCGCACGCCCGCCGTCAGCTTCCGTGTGGAGAGTGACGACGACGGACACGTTTCGTTCACGTGTCGTCCGCGGTCCCGCCGGCCGCCGCGTGCCCGGCCCTCCTCGCTGCGCCTCTCCGACGAGCCCACGCCGCGCCTGGCTGCAAGATCCCTCTCCGTGGAGGACTCGCGCTCCGCCCCCACTTCCAATGTCGCCTCGCGGACGGCGGATTCCCTGCGTCACGTAAGCTCAGCTGAACGCCTCACTCGTCTCAAGGACGAGATCATGTCCCGCGACAGGAGCCCTCCTCTAGTCGATGGACCCGCGAGCGACGACGAGTCAGCGCCGCTGGTGTCCTCGCCCCCGTCCACGCCGGCGGCTCCCTCGACCGCATCCCCACCAGCCGGCTCACCATCACCGTCACGGACAGAACCCGTGGGCGCTCGGTCCTTGAATGTGGATTGCGTGGACCGCAGTTCGCAGGAGATGACGGCCAGTACGGACTTCTCGCCCCGGTCGGACCTCACCGAGGCGGAGTCGTTGCGGGGCAGTGCGGGTGACTTGGAGTTACTGCCGGGCGGGAGGAGTGTGAACGGCTCGTCTCGCCGCGGACTGACACGAAGCGGGAGTGACGCGTCTCTGTCACTGTCTGTGGAACCCTATAACATGCGGGTATTGTCGCGCGGCGTGACTGACGGCTCCCGGGGTCTATGGCGGCAGGACGCGTTAGATTCCATAGAAAGCGCGCCGCCGTGGCCTCTTGAACCCGACTCCGCGGTATGA

Protein sequence:

>DPOGS213810-PA
MSQSPELPKTTNGENEYVHIRCLIANIHKPRNVSRPPVSESPLEVVKESPTSSDADPSCSESDISEKSEISNANDTKNTEEAPPRRAKSNSFAGAMLAPPMFNKRRRPSFLHLAVGGGNEHSGPLSAGSHFNVPRFTVTAPPGEHRRFSHGFPFHGFALRRHSNTSLHRTESMVSLACFKTLSQLVNTDGNELQNFLSSNRNINVDDKDDNGTTALMVASESGRLSAVRLLLGAGSDACAADGDGWTSLAFAARGGHLAVVQELIDAGVVIDSRDCGGWTPLMWASYKGHEDIVVLLLEKGADVHAHGNYNINSLVWAAGRRHSGVVSRLLSAGARPNSCDKYATSALTWASRAGDTASCAALLRAGADPNTAGMYCWTPLLQATHGNHFEIVQMLLEHKPNVNALDKDGCTALAIACKEGYYDISLALINSGAYINVQDHSKDTPLIYAVKGGYKNIVEALLKKHVDVDLPGKEKKTPVYTAVEKGHVAILKLLLASNPDLEHCTTSGDTALLRAVRSRNAEMVALLLERRARVAAADNRGDTALHVAMRARSKQIVEILLRNPKNSQLLYKPNKMNETPYNIDMSYNKTILGQIFGARKLNTNEDNENMLGYELYSAALADMLSEPSLSVPITVGLYARWGSGKSFLLNKLKEEMKNFARQWSETGWSWSWAVWWGAWHASLAVAACSSMAGAPSHVALALLFVLFAAMYLGFYLLWYLGNRYEWWWAGGMVAALGRRFSSLLLLLQVVFCHPPGPNDPRALPATPIRFHFTEGMKSGPGQEGEAMVVQMLGSLAEALECQYGRVCTRLARAFRPRPLSSTSGWKWRKACCIPHIITFELSFICVLLGVCVLVLYLTDPEEDPSRRDVRQGVMVGACAAAGAVLLANLYAGARALAALALPPRARLARALKRDHAHTVALRPEVQALTHTVSCLDAFTGQQTRLVVVVDALDSCEQEKVLALLNAVHALCSDPRSPFILLLAIDPHIISKAVEINSRRAFSESNIGGWDYLRNMVQLPFYLQNSALRRVKVAQQTAARRMQALAVDDFSTSLQRSVSARRLSSTSELMSSQERIKGRGEVRGEGGRGRLRPSESVASSVASGLHRPAPAPAGAADLGRVLLTDDYFSDVNPRSMRRLMNVLYVTGRLLKAFQIEFNWYQLASWVNLTEQWPFRTSWIIYHHETYEEHIEDSTSLKHIYDKVKPSMGGLREASTLIELDRDERKLEVFLSFHRSTLTAADLKIFLPFTINLDPYIKKVIKEEHAQAGVEEDLGASGAASMYASNRTQQAKPFHKKQKIVAQSVVSGTQQWSSWHHSVPPQTYVQEASQSQPMAVNPVVLLKTAFPGLGDVSTLRLSTMSTERVCGLCRVALGGVVGGGGGGNTAAAALMKHRVCGLVLTVCRLDDLKPLLDLPFGDWELFKMLILNLRDLEASMPTNTPAVTVIQEKPVDAEIDVKQRPSLEHQRSRPTNVEKQVTLEEQMICGALQTLNEEAMEDLLQSEPTGEAPSLSRSPSPSRSPSPSPSEAEPRAPHVVVNLSDDVFLGVAAASTGAHDGVVTAGAPATADRTPAVSFRVESDDDGHVSFTCRPRSRRPPRARPSSLRLSDEPTPRLAARSLSVEDSRSAPTSNVASRTADSLRHVSSAERLTRLKDEIMSRDRSPPLVDGPASDDESAPLVSSPPSTPAAPSTASPPAGSPSPSRTEPVGARSLNVDCVDRSSQEMTASTDFSPRSDLTEAESLRGSAGDLELLPGGRSVNGSSRRGLTRSGSDASLSLSVEPYNMRVLSRGVTDGSRGLWRQDALDSIESAPPWPLEPDSAV-