Monarch geneset OGS2.0

DPOGS204963
TranscriptDPOGS204963-TA4617 bp
ProteinDPOGS204963-PA1538 aa
Genomic positionDPSCF300160 + 610938-630364
RNAseq coverage220x (Rank: top 45%)
Annotation
HeliconiusHMEL0037423e-11774.44% 
BombyxBGIBMGA011132-TA2e-16848.25% 
DrosophilaCG10011-PA1e-11934.10% 
EBI UniRef50UniRef50_E2AFW21e-13338.16%Ankyrin repeat domain-containing protein 50 n=3 Tax=Formicidae RepID=E2AFW2_CAMFO
NCBI RefSeqXP_001607344.15e-13840.65%PREDICTED: similar to ankyrin repeat domain 50 [Nasonia vitripennis]
NCBI nr blastpgi|2700086194e-13738.02%hypothetical protein TcasGA2_TC015164 [Tribolium castaneum]
NCBI nr blastxgi|2700086198e-13637.71%hypothetical protein TcasGA2_TC015164 [Tribolium castaneum]
Group
Gene OntologyGO:00055155.8e-07protein binding
KEGG pathway 
InterPro domain[995-1301] IPR0206834.8e-74Ankyrin repeat-containing domain
[1071-1100] IPR0021105.8e-07Ankyrin repeat
Orthology groupMCL14010 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204963-TA
ATGTCGAACATGCCGATATACAGTCAGGTGGCGCCCAAGTATAGGGCAAAAAACGCTAAAGGCAACTCCACGCGGGAGAATAGGGGTAATCCGAGAGAATTCGACCCCCTGAATTACAACCACGGGGGCAATATACAGCTTCTGAACGCGCCGATGGAGAGTAGAATTGAGAAACTAGAAAAGAGGGAATCGAGACGCAAAGAAGAGACCAGAGTAAGAAGAACACAGAACAGAAGTGACGCCAACGAGGGGAAGGTCGAAGCGTATTTGAGGCAGAACCAGGTCAGCATAAGGCAGCAGCAGAACAGACATTCCGCTGTCAGACACGCCGAACACCGGGTTAATTTACGGTCACTCCCCCTGAACGAGATAACATCAAGGCAGGAGCGTACTCCGACCGGCCACCGGGTATTCCCCGAGAGAGAGGTCTCCAGGGACAGTAGACGACAGAGAGACAGGGAAAGGGACTTCGATTTTGATGGATCCCCGCTGAGTCCAATAGCTCGTGACAAGGACAGAGTGAAGAAGTCTTCAACACGGGTGTCGGATGTTAGTGACAAGAGGAGGTTCTTCTGTAGGGAGTGGGTGTTCCAGAAGATAGCGCACTGCTTAGAACAACGAGCGGTCAGCAAAACTGCTGGAACCCTTATATTGGGTGACGCTGGCAGTGGGAAGACTGCCTTATGTCAAGAGCTGAGCGCCCCGGGCACAGGTCCTCAGGCTAGACAACAGCGAGCGTTGAACAGGCGAATGCTCGCCAGACACTTCTTGCAGGGTCCAGGAGACTGCAGCCAAAGACCTGGAGAATTCGTCCGTTCACTGGCGATGCAGATCCTTTCACATTCTGAACATGCAAAACCTGATGACAGATCAGATAGAAATTCCTTGGAAGAGAATTTTGTTCAGCGGTTCCGTGATATGGGTGAGGATTGTGAGGAGAGCTCCAAACCACTGCTGGCTGATAGTGAAGATCAACGCACAGACGAGGAGGAGACGAGCTCTAGACAGAGAACAGCGGACAGATTACACGAGAGAGAAGAGTATACAGATTGTATTAATGACGCCGAGCTCAGTGGACATATGCTACCGGAAATACTGCCGAAGGCGGATGTGAAGCCAACGAATCCGTTCGTCAGCGACCAGAATGACCTCCGTCTATACGAGAACCACGAAAACTTGTTCCTCCGCAACATATCACAGCGACAGTCCAAAGAACTGCGGAATTCAAGACTTCTACGTCAAAGTTCTGAACCGCTTCAGGAGAAGAGACCTTCCGTGCTCCAGAAGAGTCTGTCGAACGATCAAGAAGAGAAGAAAGAGATGAACAGCCCGCCAAAATCAAGAATACCAGTCGCCAACTTCCGGTATCCGAATAAAAGTGGTCTACGTCCTGATAGTTCCCCGAAAAAGGATCCCAAAGAACTTCCCGAGTACCAGAACATAGTCCACGACGTTAAAGACGATCCGCAGACTGAGATGGAGCTACTTCTGGAGAAGAAGCGATCAGCGTCTGAAGAGGAACCGCCTCCGATCCCCAGCCTGCCCGTCAATCCCAGGACCTTGATAGCCAACGCCTACTACGAGAAGCTGCTATCGGAAACGGAAATCCAGCAGGCCTTGCTGCCCCAGAACCTGGACAAGAATCCCGATGAGTGCTTCAAGAAGGCCATACTGTTTCCGTTACTGGAGATAGATCCACCTAAGCAGTGCTTGTTTTTACTTATTGACGCTATTGATGAGGGCGCCACTAACGACGGCGATGGCAGCGAGGGCAGCGTGGCCGGGGTGGTCGGCCGTCACCAGCACCTGCTACCGCACTGGCTGCTGCTGGTGGCCACTGCGAGGAGACACTCCCGCCTGGCTAGGGTGTTCACCGGTTTCCGTAAGATAACCTTGGATGAGCTGTGTCGAGCCCATGTGGCCGCGGACGTCCAGCGCTACGTGTTGGCTCGCCTGGACAACGAACCTAGACTAAGGGCGCGCGTGTCCAGCGACGCGGCGGCGGCGGCCTCGGCGGCGGCCGCCCTCGATCATCTCCGCATCAAGAGCGATGGATGCTTACTGTACCTCGAGAAGGTGCTGGACGGTGTAGCCGACGGTTTTATAGCTCTGCGTGAAATAAGAGAGATCCCAGGTACACTCAACGGTCTGTACTTGTGGCTCGCGCAGAGGCTGTTCCACGGACGCAGATTCAATAAGGTCCGGCTGGTGTTGGACGTGCTGCTGGCCGCTCGCTGCGGTGTGACTGAGGACATGCTGTACAAGTGTCTCCTCACTAAGGAGTACAGCGTCACCAGGGAGGACTTTAACCGACGGATGCATCTGTTGAGGAGGATAGTGTCCGTGGACCGCTCGACGGGCTTCGTGGCGATCTTCCACCGCTCCTTCTCCTCGTGGCTGGTGGACGTGAAGCACTGCACCCGGCGCTACTTGTGCGACGTCTCCGCGGGTCACGCCGCCCTCGCCATGCATTACACTCTAGAAGCCAGAAGACTGTCAGCTCTCGAGATCCATCACTACGTGTATCACATGACGCAGCTGGAGCAACACCTGGCCTCGCTCAAGAAGGGGAAGCTCGGCTGTGAGCCCGTGGAGCTTCATACTCTGGTGCTGCTCTGGGTGTTGGACTCCGGCTGCCAGGTGGAGGCGGCGCTCCAACATGACAGAGGACAGATCGAGGAGAAAATCGAAGATAAAGATCAAGATCCGGAGTCTGAGGGAAAGGAGTCGACTTCCTGTAAATCATTGGAACAGTCCGCTCTGGAGAACATAATGCCGGAGCTGGTGAACGGCAGCACTCCCAGGTGGCCGAGGGACAGGAGGGTGATGCGGGCCCTCATGGAGCTCAGCAGGACGGATTCGGTCCCCACGGAACCCGAGGAAGACGTCAATGATCTGCTGTCCACTGAGAAGGCGCTGGAGAGTGAAGAGAACGCGACCGGGGACGAGCACGATGAGGCGCTACTCCTGGATCCGGGGACTGTTCATGAGTTAGCAGCGAGAGGAGATGAAGACGCGTTATCAGTTTTATTGAAGCGTCGTCCTGAGCTGGCTCAGTCGGTGGACGCGGCGGGGGCCACGGCCTTGCACGCCGCGAGAGCTGCGGCCTGGGCTGGACACGTGGAGGTGTTTATTTTTACTTGTTTAAGTAACGTGGAGGTTGTCCGCCAGCTGCTAGACCGGGGGCTAGACGAACACCACCGGGACAACTCCGGCTGGACGCCGCTACACTACGCCGCCTTCGAAGGTCATATAGAGGTCTGCGAAGCGCTTCTGGAAGCGGGGGCGAAGGTCGACGAGGCCGACAACGACGGCAAGGGACCTCTCATGCTGGCGGCGCAGGAGGGACACACCAGGCTCCTGGAACTGCTCGTAGACACCTGGGCCGCCCCGGTCGACCAACGCGCGCACGACGGCAAGACGGCGCTGCGCCTGGCGGCGCTGGAGGGGCACTTCGAGGCAGTAGCCGCGCTGCACTGCCGCGGGGCGGACGTGGACGCGCTGGACGCGGACCGGCGGAGCACGCTATATGTACTGGCCTTGGACAACAGACTGGCGATGGCCAGGCAGCTGCTGGCGTGCGGGGCCAGCGTACACTCCAGTGACACTGAGGGTCGGACTCCTCTCCACGTGTCCGCCTGGCAGGGGCACACTGAGATGGTCAATCTGTTGATAAAAGTCGGCGGGGCGTCCGTGGACGGCCGGGATCGCTGCTCACGCACGGCGTTGCACGCGGCGGCTTGGCGCGGCCGGGCCGGGGTGTTGCGGACCCTGCTGGAACACGGAGCGGACCCCGCGGCCGTGTGCACCCAGGGAGCTACGCCGTTGGGTCGTACGCCTGCTAAAGTAGCCTGGAGAGCTGGACATGCGAACATCTGCCGGCTTCTGGAGCGCTGGACCGCGCCCTCCGCACCTCCAGCACCTCCCGTCACACATCACGAGGACAAGCGACCAGCCTCCCCGGAGTACAAACGCCGTAGTATCCACAGCTCCAACTCCACAAAATCATCGTCCAACATGACCGGCGGCTCCAACAGGTCACACGACGAGGACGATAAGGGTTCCCTCTCTTTCGCCCAGCAGGTGGCGCGCTGTGGACGAGCGAGACGGGAGATAGAGAGAGACGAACCGATACCAGAGCACCAAGTGCTGGAACAGGACTCCAAGCTCAGGAGTTATATAGCGAATGAGAGGGACAGCGAGCTACATGGATATGCGAGGGAGAGAGACAGGAGACGGGAACAGAGACACGGCACCACCAGCCCGCTGTACGCCTCGCCGCCCAGGAGCCCCAGCGAACCACGGAGCCCCGACCCGCCTGCTGGTTCCCAGCCAGCCAGTCTAACGAGCGCCCCGGCACTGACGGACAACCACTTCAACAGAGACACGCACATGAGGATCATCCTGGGCAGAGACAAGCACGCGGAGAAACATGACGGTAAAAATAAGAGGAATGGCATCGTCACCAACCCGGCGATGCGTCTGGTCGCTAACGTTAGGAACGGTCTGGCAGCTAACATTCGCCGGACGGGGGTCGCGTTAGCAGCCAGCGCCAGTTCCTCCAACCCAGCAGTCAAGACCAACGCGTTCCAGTGGAGGAAGGAGACTCCGCTCTAG

Protein sequence:

>DPOGS204963-PA
MSNMPIYSQVAPKYRAKNAKGNSTRENRGNPREFDPLNYNHGGNIQLLNAPMESRIEKLEKRESRRKEETRVRRTQNRSDANEGKVEAYLRQNQVSIRQQQNRHSAVRHAEHRVNLRSLPLNEITSRQERTPTGHRVFPEREVSRDSRRQRDRERDFDFDGSPLSPIARDKDRVKKSSTRVSDVSDKRRFFCREWVFQKIAHCLEQRAVSKTAGTLILGDAGSGKTALCQELSAPGTGPQARQQRALNRRMLARHFLQGPGDCSQRPGEFVRSLAMQILSHSEHAKPDDRSDRNSLEENFVQRFRDMGEDCEESSKPLLADSEDQRTDEEETSSRQRTADRLHEREEYTDCINDAELSGHMLPEILPKADVKPTNPFVSDQNDLRLYENHENLFLRNISQRQSKELRNSRLLRQSSEPLQEKRPSVLQKSLSNDQEEKKEMNSPPKSRIPVANFRYPNKSGLRPDSSPKKDPKELPEYQNIVHDVKDDPQTEMELLLEKKRSASEEEPPPIPSLPVNPRTLIANAYYEKLLSETEIQQALLPQNLDKNPDECFKKAILFPLLEIDPPKQCLFLLIDAIDEGATNDGDGSEGSVAGVVGRHQHLLPHWLLLVATARRHSRLARVFTGFRKITLDELCRAHVAADVQRYVLARLDNEPRLRARVSSDAAAAASAAAALDHLRIKSDGCLLYLEKVLDGVADGFIALREIREIPGTLNGLYLWLAQRLFHGRRFNKVRLVLDVLLAARCGVTEDMLYKCLLTKEYSVTREDFNRRMHLLRRIVSVDRSTGFVAIFHRSFSSWLVDVKHCTRRYLCDVSAGHAALAMHYTLEARRLSALEIHHYVYHMTQLEQHLASLKKGKLGCEPVELHTLVLLWVLDSGCQVEAALQHDRGQIEEKIEDKDQDPESEGKESTSCKSLEQSALENIMPELVNGSTPRWPRDRRVMRALMELSRTDSVPTEPEEDVNDLLSTEKALESEENATGDEHDEALLLDPGTVHELAARGDEDALSVLLKRRPELAQSVDAAGATALHAARAAAWAGHVEVFIFTCLSNVEVVRQLLDRGLDEHHRDNSGWTPLHYAAFEGHIEVCEALLEAGAKVDEADNDGKGPLMLAAQEGHTRLLELLVDTWAAPVDQRAHDGKTALRLAALEGHFEAVAALHCRGADVDALDADRRSTLYVLALDNRLAMARQLLACGASVHSSDTEGRTPLHVSAWQGHTEMVNLLIKVGGASVDGRDRCSRTALHAAAWRGRAGVLRTLLEHGADPAAVCTQGATPLGRTPAKVAWRAGHANICRLLERWTAPSAPPAPPVTHHEDKRPASPEYKRRSIHSSNSTKSSSNMTGGSNRSHDEDDKGSLSFAQQVARCGRARREIERDEPIPEHQVLEQDSKLRSYIANERDSELHGYARERDRRREQRHGTTSPLYASPPRSPSEPRSPDPPAGSQPASLTSAPALTDNHFNRDTHMRIILGRDKHAEKHDGKNKRNGIVTNPAMRLVANVRNGLAANIRRTGVALAASASSSNPAVKTNAFQWRKETPL-