Monarch geneset OGS2.0

DPOGS205978
TranscriptDPOGS205978-TA3555 bp
ProteinDPOGS205978-PA1184 aa
Genomic positionDPSCF300164 - 117958-124004
RNAseq coverage423x (Rank: top 29%)
Annotation
HeliconiusHMEL0148670.074.46% 
BombyxBGIBMGA009407-TA0.071.53% 
Drosophilainsc-PA5e-1926.17% 
EBI UniRef50UniRef50_UPI0001792C763e-6732.60%UPI0001792C76 related cluster n=1 Tax=unknown RepID=UPI0001792C76
NCBI RefSeqXP_001952839.16e-6832.60%PREDICTED: similar to inscuteable [Acyrthosiphon pisum]
NCBI nr blastpgi|1936669121e-6632.60%PREDICTED: protein inscuteable homolog [Acyrthosiphon pisum]
NCBI nr blastxgi|1936669126e-6631.75%PREDICTED: protein inscuteable homolog [Acyrthosiphon pisum]
Group
Gene OntologyGO:00054881.1e-13binding
KEGG pathway 
InterPro domain[856-1161] IPR0160241.1e-13Armadillo-type fold
[793-1047] IPR0119897.4e-06Armadillo-like helical
Orthology groupMCL20503 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205978-TA
ATGTCGAATTTTCAACGTACAAGAAGCAAGGTCTGGTGGGGTGACTCACCACCAAATATCGGCAGCCCAGTGGCACAACCCGAGTGGAACCAAAACAATCAGAGAGTTGAACCAGAGTGGGAGCCCCAGCTTCCTCATTGGATCAATCAACCCGAACAAAGAAACAATAGCGCGGACAACTTGGCGCAAGCCAATAATTTGCCGAATCAATGGCCTCTTCGTCACGGCAGTCCAGGAAGTCATAAAAGCCAGGATTCTGGTTTTTCAGATTCTGATAGCTCTCCTCCAACGTCCCAATACTATTCGCCATCAGATGAAAATGACAAAAATAAATCAAATAACAGCAGCGATTCAAATAATAATGAAAACGTTACTATAAAACAAGCGGCTGAGTATCAAAACTCACATCACAACAGTAAAGATGAAGTTGACTATTCCAATGTCAATGTTAATACGCCGACACCAAAACCTCGTCTCCGCAGCAGCAGCGTCATACAAACTCCATATGACCTTCAAGAGTATTTCGAAGTACACCAAGATCAAGAACATATGGATTTATTAAAGGATAAGAAATTCGATGAAATTCAGAAAAATTTAAATGACGATAATCAACGTAAAGATAAACTCAACAAATCTTTTGACACAGCTTTAAATTCTAATTTGAAAAGAAACGCACATAATAACTCTATCAAAAGTATGACTCAGAATAATAAAGAAAATACTCCAAATTCAAGTGCAATAGAAAACGATGCCAAAGATAAAAATTTAAATATTTTAAGTCCGTCTAAAAAGTATCCAGCCAAAAAGATTACAACTGGCGACAGCGATTGTAACCAAACGATTGTAAAAATGGCTACTGTTAAAGACTTGGATACAAATAAAAACAAATGTGATGATAGTTTAGAATATATACGATTATCACACAATAACGAAATTGCAGTAAACAATCAGCCGAGGGTGCGTACACCAATAACAAACGTGTCATACGTTCCCACGGAAAAAATATCGTCTTCCAAGCCGGAATACAGACGCAGTATGTCAAATGAAGAAAATATTGCTATCACGACTACTCCTAAGTTCCAGAGGAACAGACTTAACCAGTCGCTTAATTTAGAGGGCCAGAAATTAGACCGACAAATATTACAAATACAAGAAGGCTATCACTCCTTAGGATATATAGATGAAGAAGAATTACCAAATCAACCAGCTGTTTATCCAAAGGAACAGGTTCATAAGCCAACTAAAGCCTTGCTAGGAAAGAATATTATGGATAGTCTACATTTAAAACCTTCGAAAAAGGTCGGTCCTAAAGCTAAACATCGATCAGCCGGGAAGGATAAAGCAAAGGAAGCCTTTAGGCTGTTTTCTAACAGAAAAGAAAAGCCAGGTGTCGTTGATAAACTTGCAGGTGGAGTGCTAGACGGTAGCAGGGTACCTTCCTTAGGTCTGCCAAGAACATCAGACCAAAACTCTTGGGTAGTGGTTGGACATCCGCAGAAAGAAATGCTCATTCCACACTACAATGAGAGACTTGTTCACACAGATAGTGATATTAATCTAAAAAGTAAGGATGTCCAAAATGGTAAAGTGGATAAGAACAGATTAACACCTCCGCCACAGTTCCAAGATAAACAAAAAATACCGTACGAAAGCAATTCAAAAACCGACAAAGTATTAACGCACGATTCAAAACGTGAGAAATTAAAAGATGACTATACAAGCTTTATGGATGTACAGTTAGACCTAGGTTGCACATCCACACCAAAAATGCTAGCTCCTCACGAATTCATGACTGACGTACTTAGTAGCAACAAAAAGCCACGCAGAGTAAATTTATTACAAAACTTTAACAGTGGGTTAGTGTTAAACAACCGAGATGAGATACTGTACAGAGAAAGGAGTATAGATCAGACGAATTTGGAACGTCTGTGTGATAGGAAACCAGAGCCTGTGCAATATTGGCTGTGTGATTTAGTGGGGTCTTGTGAGAACGAATGCATGACAACCCTGCAGAGCAAGCCTTTGGGAGCGGAAATGAAAGCTATGGTTTCATCCAACTCGGCAACTATAACCGGAAATATAAAGAAGGTGCAAACACACGGACAAATTATCGTTCATCAATACAGCGATGTTTTACGACAAATGGAGACGAATAACACCAGTGAAGAACAAATATGCTTATTAGTGGCGAATATTAACGAATTTGTGCTAGCACACAGCATCACCCTGAACACAAAACCTCCCGGAGAGACCCCAGAGAGATGGGACAAAATTAAAATGTCCCTACAAATGTACCTCCAGAAGTTAATAGACATCGGAGAAGATGTTAAAGTTGAACTGAATGAGAGCAGACCCGAGTGGAATTTGGTACAAAAACATTTGGATACACTAATTGATATATTTTTGGAAACCACAAAAGCCGTGCTCGTCCAACAAATGAAGACGCTTGTGTCAATTATCGAAGAACCACCATCAGACATGATACTAAAAAGCACTCTCACATCCATCGGCCATTTGGCGATGTTGACGGAAAACACGGAAAGTTCATTACACGTAAAATGTACGAAAATCATGAAAAGCATCACGGAACTGCCGTTCGTCGACTGCGGTGTACCGAGGGCCTTGTTGACAGCAATTATCGAAAGCAAGAAAAGTTCTATAAAAGCGCTCTCTTTACGCGCGCTTGCCACTGTCTGTGTGACAGGTGAAGCTGTCAAACAATTTGAAGAGGGAGGCGGTATAGAAATCTTATCGGACATACTATCAGACAGTTCCAATCCTGAACCTCAAATGCGTGAAGCTATCACCGTTCTCACTCAGATAACAGCGCCCTGGCTGAGGGGACACTATGATATAACACAGCTACACGTATACCTCGACAACATAATAAAGCATATCACTGATATTCTCTGCCGGACTGAATGCTGTCAGCTTCTCCTTCTTTGCACGGCGTGCCTCGTCAATCTCGTGAGGGAATTGGAACTTGAGAAAAAAGAAGACGCAGACTCCGATATAATAGAAGTGATAAATAAATATCACACGGTTGAGAAATTATTGGACGCAGTCAAACGACGGGGACCAAACATATCTGTATTCCTACAGGAACAGACGGCATCTCTCTTGGCGTCTCTGTCTCGTGTGTCACGGTGTATTCTACCCTTGTCACGATCCCCGGTCGCCAGTGTGGCGCTAGTGTGTTTCGCGCGCAGCGGACCGCTAGCGACCCGCGCCGCAGAATTACGCGCAACTCATCGTTTACATTGTCATACTGCTGAAGCTATATCTAGGTTAGCAGTAATTCCAGACGTGGCACATCAGATAGTACAACTTGAGGGCATTCCTCATCTTATCCGATTGGTTAGGATGCAGAGAACTCGACCTCATGTGGATGAAAACCCAGATCAGACACTCATCCACACCCTGAAAGCACTTCGAACTCTACACAAACATTACCCTGAAGCTTTTAATGATGAAAAGGATGTCAACGAAATGATTAAACCGCACTTGATGGAATCTCTCATGTTATTTTCTGCTAAACAAGAAAGCTATGTTTAA

Protein sequence:

>DPOGS205978-PA
MSNFQRTRSKVWWGDSPPNIGSPVAQPEWNQNNQRVEPEWEPQLPHWINQPEQRNNSADNLAQANNLPNQWPLRHGSPGSHKSQDSGFSDSDSSPPTSQYYSPSDENDKNKSNNSSDSNNNENVTIKQAAEYQNSHHNSKDEVDYSNVNVNTPTPKPRLRSSSVIQTPYDLQEYFEVHQDQEHMDLLKDKKFDEIQKNLNDDNQRKDKLNKSFDTALNSNLKRNAHNNSIKSMTQNNKENTPNSSAIENDAKDKNLNILSPSKKYPAKKITTGDSDCNQTIVKMATVKDLDTNKNKCDDSLEYIRLSHNNEIAVNNQPRVRTPITNVSYVPTEKISSSKPEYRRSMSNEENIAITTTPKFQRNRLNQSLNLEGQKLDRQILQIQEGYHSLGYIDEEELPNQPAVYPKEQVHKPTKALLGKNIMDSLHLKPSKKVGPKAKHRSAGKDKAKEAFRLFSNRKEKPGVVDKLAGGVLDGSRVPSLGLPRTSDQNSWVVVGHPQKEMLIPHYNERLVHTDSDINLKSKDVQNGKVDKNRLTPPPQFQDKQKIPYESNSKTDKVLTHDSKREKLKDDYTSFMDVQLDLGCTSTPKMLAPHEFMTDVLSSNKKPRRVNLLQNFNSGLVLNNRDEILYRERSIDQTNLERLCDRKPEPVQYWLCDLVGSCENECMTTLQSKPLGAEMKAMVSSNSATITGNIKKVQTHGQIIVHQYSDVLRQMETNNTSEEQICLLVANINEFVLAHSITLNTKPPGETPERWDKIKMSLQMYLQKLIDIGEDVKVELNESRPEWNLVQKHLDTLIDIFLETTKAVLVQQMKTLVSIIEEPPSDMILKSTLTSIGHLAMLTENTESSLHVKCTKIMKSITELPFVDCGVPRALLTAIIESKKSSIKALSLRALATVCVTGEAVKQFEEGGGIEILSDILSDSSNPEPQMREAITVLTQITAPWLRGHYDITQLHVYLDNIIKHITDILCRTECCQLLLLCTACLVNLVRELELEKKEDADSDIIEVINKYHTVEKLLDAVKRRGPNISVFLQEQTASLLASLSRVSRCILPLSRSPVASVALVCFARSGPLATRAAELRATHRLHCHTAEAISRLAVIPDVAHQIVQLEGIPHLIRLVRMQRTRPHVDENPDQTLIHTLKALRTLHKHYPEAFNDEKDVNEMIKPHLMESLMLFSAKQESYV-