Monarch geneset OGS2.0

DPOGS212766
TranscriptDPOGS212766-TA2934 bp
ProteinDPOGS212766-PA977 aa
Genomic positionDPSCF300012 + 856987-868120
RNAseq coverage62x (Rank: top 68%)
Annotation
HeliconiusHMEL0155279e-13647.35% 
BombyxBGIBMGA013138-TA8e-8242.83% 
DrosophilaCG32668-PA8e-3428.57% 
EBI UniRef50UniRef50_D7EJ198e-7528.86%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D7EJ19_TRICA
NCBI RefSeqXP_972605.21e-6928.50%PREDICTED: similar to Armadillo repeat-containing protein 2, partial [Tribolium castaneum]
NCBI nr blastpgi|2700160523e-7428.86%hypothetical protein TcasGA2_TC012901 [Tribolium castaneum]
NCBI nr blastxgi|2700160521e-6728.71%hypothetical protein TcasGA2_TC012901 [Tribolium castaneum]
Group
Gene OntologyGO:00054884.1e-19binding
KEGG pathway 
InterPro domain[454-758] IPR0160244.1e-19Armadillo-type fold
[330-761] IPR0119898.3e-17Armadillo-like helical
Orthology groupMCL16888 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212766-TA
ATGGGTGAGCGTATCAAACGCGTCGGTGCACCGTTCTACGCCCCTCCAAGGAAGACGTCCGCTGAAATCATCAGCGAAGCCCGAGCTGCTATCACTGCAGGCAAGTCGTGTTTGAAACAGAAGCATCATTTTGCAGACATGAGTGGTGCGAGCGGGTTAGGCCTTCGCCCTTTACGCACACGCCGGCCGTTTACACCTCGGGAGCCCCAGAGGACTCTACTGTCGGACACGAGACGAGTTGACATACGTCCCACCAGCGGCTTTGATCTGAAGTATCAAACGGTTCAAGAGAGTAGCGAGGATGCGTTTATGAATTACCAGGTATCGGAACAAGAGCACGTCTCAAACGGTGTTCCGTTCGGAGAAAATCAGCAGCAAAGAAAGAAAACGTTGAAGTCTACGAAATCTATTAAAGGCGCCGACGCTTGGAGCGGTTTTCCTAAGCTACCTCACCTCAGCGGAAAAAGCAAGCCGTTACATAGAAGGAATACAATAGGACAAAACGATGCATCCGATTCAAGTAAGGAGCTGGCTCAATCTATATCAGTTACACGTCCTTTATCTGTTAGTAACGGTCCACTCTCGTATCTCAGATCGTTTCACGAAAAAAGTCAATTTTGCAGTACAGATAGCTCTAGCAGAAGTAAAACTTTGGGTGAGAAGTCGGTATCTTACGACGAGGGTACCCTGGGCGAGGTGTCCGTGAGACACCTCGATGTACAGCTTCCTGTCACCAGCGAGGATTGTCACAACATGACAGCCTTAGAGATATCAGAGGCGCTGACCCAGAAGAATCAGAGCGTTGATCGCGTGCTGTTTCTACTGGACGCTCTCCAAAAGACCGTCGAGGAGACCAGCCCCGGGGACAGTCTCCGCGAGCTGGTGCTCCGAGCACTGCTCTCTCGAACTCGCGATGATAGTGAGAGGGTCCTCGTCAAGGTCGCCAGAGTCATGCTGACGATGCGTGTCACTGGAGCGTATTTGACTGCTGCCAGCAAACTTGTCTTCAAAATTGCCAGCAACGATAAGAATGATAGTGTCTTCAAGAATGGGAACCTTCTCGAACTGATCGTGGAGTCGTGTGCACGCGCGTGTCCCCTGTCCGAGAGCTCGAGTGTGTTGCACGCAATGGGTGCTCTGCGAGTGTTGGCCCTGGAGCCTTCCCTGGCGGCCCGCAGCCGGACGGCCGGGGCTCTACACCTCGCCGTCCTGCACCTCAAAATCATTAATAACGCAAAAGCTGAACGTCCGAGGCAAGTGACGGAGGAGACGACGCACGCGCTGTACCAGTTGACTGGGGCTCTGAGGAACCTCGCTGGCAGCGGGTGGCAGGAGAGGGAGGGGGAGGAGGAGCGGAGCGGGGAGGGGGACGCGGGGAGGGAGTTCGCCAGCAGCGGAGCCATAGGAGAGCTCATCAACGCTTTGACATTACATACAGATCGAGATGTACTCACTAACGTTGCGAGGTGTCTAAGCGTGTTGTCATCATGGCAGTCTTGTTTGAACGCCTTGTGTTCCTGTCCGGGAGCGTCTCGTGCCTTGCTAACAGCGCTGGGAGCGTGCGCCGCGAGGGCAGCGCTAGCTGTGAGACTCGCCTACACGTTGGGCAACATGGCCGCCCACTGCGATCAGGCTAGGATAGACATATACAGCGAGAAGGGCAGCATCGATGTACTGTTGACTATACTGGAGTCGTACACGCAACGTAACGACAACGACACGAGAGACCACGACAACGATCCGGACTTACATCTAATAGGTTCCGATCTCGGCGGATCGGACGGCTCAAACGAGGACGTCCTCATAAAGACAGTTCGAGTGGTCGCGAATCTCTGTCTGACGGAGGAGACAGGTCGCGGGCTGGTGGAACACGCCGATAGAACTGTGAGGGCGATGCTGAGCTGTCTGGAGGTGGCAGCCAGGGTTGGGGGGAAGGAACTAACAGATGCAGAGCGGAGGTCCAGCGAGGAACGTCGCGAGGAACTGGCGACGGCCGCACTGGCCACCATCAACAACATCACCTTCTACTTTGAGCCGACAGACTCCACACACTTTGATACCTTGGACCACTTGGTTAAAGTAACATGTGGCTGGTTGTCTCACGGTGGGCTTCCAACACACGAGGCGGTTCGAGCGCTCGGTAACCTCACCCGGTGTGATCGCGCCGCTCGTGCCGCCGTCCTGTACGGGGCCTTAGACGCTTTACCGCCTTTACACGCTCACGACGACGAGGAGGTTCGCAGCGCGTCCGCTGGTGTTTTGGTGAACGTGTGCGGTGTGAGCGTGGGCGGCGGAGTGCAGGAAGCCGGGGGCGCGGCCGCTAGGGCTCTGGCCGCGGCCGCGAGGATGAAGGACGTCCGCTCGGGGGCGCTGCTGGCGCGAGCCGTGTGGAACGCTCTCGAACAACGGCCGTTGGACCTCCACAACGCTAGGATGGCGGCCGCGGCGCTAGCGACCTTCATAGGCAATACTATCCACATGTTTGGTCTCGTTGGCCATGTGACTCCGTGGCGCAACGGTAGCGCGTCTGACTCCAGATCAGAAGGTTGCGTGTTCAAATCACGTCGGGAAGACGAGTCATTGTTCGCTATGTGTGAAGCCGCAAAGTGCGAGGAGCGACGCGCAAGCGACCCAGATATAATGAAAAACCACAGTGTAAAATTGGGTTTGGAAGGTCGAGGCTACCACCGGGAGCATGTTGAATCGAAATTTTCGTTGAGCGTGGAAGAAGACTTGCACCTGGAAGAGGAGTTTGAAGAAGAAGGGGAAAGATGGTCAGGCTCGGACCTGGGTTTCGAGGAGGGCAGTCCGGAGCCCTGCTCGTGTGGACGCTGCGCCCGCGGCAGCTCATGGAGGGCCCTCGCCGACGTGGCGCTGCCGTTGCTGCAAAGACTGCTGCCAGCACGACGAGATGCAGCAGTCGGCACAGATTAA

Protein sequence:

>DPOGS212766-PA
MGERIKRVGAPFYAPPRKTSAEIISEARAAITAGKSCLKQKHHFADMSGASGLGLRPLRTRRPFTPREPQRTLLSDTRRVDIRPTSGFDLKYQTVQESSEDAFMNYQVSEQEHVSNGVPFGENQQQRKKTLKSTKSIKGADAWSGFPKLPHLSGKSKPLHRRNTIGQNDASDSSKELAQSISVTRPLSVSNGPLSYLRSFHEKSQFCSTDSSSRSKTLGEKSVSYDEGTLGEVSVRHLDVQLPVTSEDCHNMTALEISEALTQKNQSVDRVLFLLDALQKTVEETSPGDSLRELVLRALLSRTRDDSERVLVKVARVMLTMRVTGAYLTAASKLVFKIASNDKNDSVFKNGNLLELIVESCARACPLSESSSVLHAMGALRVLALEPSLAARSRTAGALHLAVLHLKIINNAKAERPRQVTEETTHALYQLTGALRNLAGSGWQEREGEEERSGEGDAGREFASSGAIGELINALTLHTDRDVLTNVARCLSVLSSWQSCLNALCSCPGASRALLTALGACAARAALAVRLAYTLGNMAAHCDQARIDIYSEKGSIDVLLTILESYTQRNDNDTRDHDNDPDLHLIGSDLGGSDGSNEDVLIKTVRVVANLCLTEETGRGLVEHADRTVRAMLSCLEVAARVGGKELTDAERRSSEERREELATAALATINNITFYFEPTDSTHFDTLDHLVKVTCGWLSHGGLPTHEAVRALGNLTRCDRAARAAVLYGALDALPPLHAHDDEEVRSASAGVLVNVCGVSVGGGVQEAGGAAARALAAAARMKDVRSGALLARAVWNALEQRPLDLHNARMAAAALATFIGNTIHMFGLVGHVTPWRNGSASDSRSEGCVFKSRREDESLFAMCEAAKCEERRASDPDIMKNHSVKLGLEGRGYHREHVESKFSLSVEEDLHLEEEFEEEGERWSGSDLGFEEGSPEPCSCGRCARGSSWRALADVALPLLQRLLPARRDAAVGTD-