Monarch geneset OGS2.0

DPOGS210232
TranscriptDPOGS210232-TA4650 bp
ProteinDPOGS210232-PA1549 aa
Genomic positionDPSCF300196 - 239806-251770
RNAseq coverage411x (Rank: top 29%)
Annotation
HeliconiusHMEL0105066e-11835.14% 
BombyxBGIBMGA002552-TA4e-8540.12% 
Drosophilal(3)76BDr-PA1e-4027.15% 
EBI UniRef50UniRef50_D6X3R84e-7827.90%Putative uncharacterized protein n=3 Tax=Tribolium castaneum RepID=D6X3R8_TRICA
NCBI RefSeqXP_968947.23e-7927.90%PREDICTED: similar to Zinc finger protein 294 (RING finger protein 160) [Tribolium castaneum]
NCBI nr blastpgi|1892415287e-7827.90%PREDICTED: similar to Zinc finger protein 294 (RING finger protein 160) [Tribolium castaneum]
NCBI nr blastxgi|2700010123e-7827.70%hypothetical protein TcasGA2_TC011290 [Tribolium castaneum]
Group
Gene OntologyGO:00054887.7e-09binding
KEGG pathway 
InterPro domain[63-843] IPR0160247.7e-09Armadillo-type fold
Orthology groupMCL34827 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210232-TA
ATGGGTGGAAAAACAAAACAGAGCCAGAGAACTAAGAATAATGTTCGGCCCTCAAGTAGTGGCAGAAGTGCAGAAATACTTAGTAACAGTCTGAAATTGGATTCATCATACGTTATAACGGGGTCGGGGAAAGTTCTTCCGGCTTTATTTCCAACACTTACAACAACACCAATTGACCAAGGCCTTAGTCCTGAATATACAGTGTGTTTTAAAAAATTCCACAAAAAAGATCCCATCACTAAAACTAAGGCTTTACAAGAGCTCACTGAACTAGTTAAAAATGGTGATAAAGAAGAAGTGGTAGCAGCGCTACCAAGCTGGGCTAACATCTACCAAACACTCACAGTCGATGGTGATCGGAGAGTCCGAGAAACTGTACAGACATGTCACAGTGCTGTTATCTCAACGTGCGGTCGTCGCGCGGCTCCAGTACTGCGACGTATTCTCCCCGCTTGGTTGCTGGCCAGCGTGGACGACCACGGACCAGCACAGACTGCTGCTCTCAATAGTCTTCAGAACACATTCCCTGATGGAAAATTAGCTGAGGCCATATCATTCTGCAAGTCGGAAATCTTGTCATTACTAATAGACAACCTCACTGGAAATGCAGAGGCTATGCTCAGTAACAAGGTGGAGGACGTTGAAGAGCGAAGTATGCAGGTGTCCCGTCTGTTGGGCTGCAGCTTGCTGGCGCTGCGAGCCGTGCTGTTGAAACTAACGGAACACCTGCAACATATAGAACAGCTGGAGGAACAGCGGGACTGGCTGACAGGACAGCTGGAACACCTGCTCGGGAATAACGCCTTCTGGAAACTGGCCACACACGAGTCCCAACATGTCCGTGTGGCTCTATTACCGCTACCCTATATGTTGGCTCGTCTCACTAATAACCTAACTCCTCCCCCCTCCCATAGATACTCCTGGTATTCCTGTATGGGTTCGCTGTGGTCGGTGTACCCCGGGGCGCCGCCCCCGAGTCACTCCCGCCGCCTGCTGCGGGCGCTCACCAACAGGCCGGAGTCCGGCGCTTCGCCCTGGAGCGCCATGTTGCTGATAGCTAACAGTCACCCGAATTGGCACGAGTGGTTAGACAAGAAGGATCTCCTGGTTAAGAGAGTCGTGGGTGTTCTAGAGAGTGGTGGTGGAGGGGAGTGTCGTGTGTTGAGTGAGTCGTTGCGCGGCCTGCTGTCGTCACTCCCTCGCGACATGAAGACCAAAGACTTCTGGTCGACCTTCTTCGATGCCGCCTTCAAGGGTCTTAAAAACAAAAATCTACTGAGTTCGAGGAGTGAGAGACAGGGTTGGATATCAAATATAGCGGAATGTCTGAAGTATGTGAGCGAGCAGAATGACGACTGGGTCGTTGAAGTTATCACGGAAGTGCACCGGACGTGGCTGGAGCTGGTCGCTACCGCACAGGACAGGCTGACAAAGACCAATATGATCAAACATTCCGGCACGCAGATGGCGATGTTGGTCAAACACTGGCTTCAAAACTCCAAAGACTTAAGCGACGAGAAATACGACCAACTCATAAGAAACTACTGGCAGAACATCGGTTCAACGATCGCCTCGCACATCGATATCCTGTCCCTGGAACGGAGCGACATTGCAAACTCGATCGAAGCTCACACTGTAATGTTGAAAGCGTTAAAGACGGGGTACCACGAGGACGCCAGACAGCAGCGCTGCATCAAATTCGCTGACGAAGATGACAAAACGCAAAAGGAACCAATTAAAGTGGAACTAGACCGGCGCCTACAGGAGCGGTTCGAGCACAACCTGCACGACGTCGCGGACAACACGTGTTGTAGCTACTTCGATTTTGCTCAAACGAAGCAAGTCTCCAAGTGGGTGTTCCCGTCGCTCCAGCCTCTGCTGGCTGAGTTCGAGTGCGAGCGTCTGTTTCGATCTCTGGCGAGACATTTCGGTTGCCAGAACGTATTCGGTTTTTACGACAAAGTGCTCAAGACCTGGCTGTCCGGAGACACGATGAGATGTAACGCTCTGGTCGACGTGATGTTCACTCTGGCTCAGTATTTGAATGAATCAGAACTCGAAGCCATGTACGACTCCTTCCAGCAGTTTCCTCCTCAAGTGGTGGAGTGGTGTCTGTCTCCGTCCTTGTCTCTCTCGCGGGGATCGTCGTCTATTGCAGCGAGCTGGCTGCGCGGCCCGGTGTGCGAGGCGGCCGTCGTGGGGCTCAGCAAGCGACCCGAGGACCCCGCGGCCCGGCAACTGTTACTGGAATGTATGGCTGTCGATGAACGGGGAGAGCTGCGCGTGAGTGAGTCGACGGTGGTGAAGGTGCTGAGTGTGTTGTCGTCCCTAATGCCGGTGCCTCCGCCGAGCGAGGTGTCGTCCCTGGCCTCGTCCCTGGTCCTCTCTCTGCACGCTACCAACGTGTCCGACCGCTGTCAACAACTCGTCCTCGTCATGTTCGAACTGGTGCTAAAACATCCGAAATCGTCTTCTCTTCACTCCTCGCTCCTCGAATGTCTGCGCGTGTTGGGTCGCGAGTGTTCGTCGGAGGCGCTCCGCAGGGCGAGGGCCTGGCTATATCACGACATAGAGGAACTGGACATGAGTCGTATAGAGCACGTGGCGTCTCTGTGTCCGCACGTGTTGTTGCCGAGCGAGTCCGTGCCCGCAGACACCAGTGACCTGTCCGCCCTCACTCGCGCCCTGATGGACGACCAGCTCCGGCCGACCGTTCCGCTCGAGGTGTTCTCCCTGAGATGTGACTGCATCTTCGGGAACATCAACTGTCCGATAGAAGACGACAATGACGTCGTCAAGACCATCGTTGCCACCTCCGACATGGACCACGCGGAACTTTCCAAAATGGACCTTATGGTGCATGTGTACAAGAGTCTGTTCCGCTCGTTCTTCATCCAAGCCATCGTCACGCGGCACGCGGACATCATGGCGGACGTGTTCTTCCGGGAACAATACGCGCTCTGCCTGTACGAATACGTCATAATCAAGACTCTGTATGACAGATACGCCTTCTGGTGCCACTATGAAATTATATACGAGACCAAGAACCGCATGGACCTCGTGCTGGACGACATATTCACCAAGACGCCGTACAAACACAAGGCGTCTCTGTTGATATATCTGTCGGAGCAAGCGGCCGCCAAGGGCTACTACTGGTCGTACGCGACCAGGCTGTTCGACGACAAGGTACAGGAGTACGTCAAGATGAACCCCGACAATGAACCCATCTGTGAAGTCGAGAAGGAGGTCTGCGTGGAGGCGGACCTCTCCAAGCAGGTTCTGGACTCCATGTGTATAGAGAAGTTGAAGGACATCATGACCGGCAACGGTTTCTTCCACAGCTTACAGGCGGAGCGCGGCGGGGTCCAGCGTACTCGTTACATGGTGATGTTGCGGAGTGTGTTCGCGGCTCACAGTCACGAGCCGGATGTGCTGCGGGCCGCCCTGTCCCCCGGCTGGGGCTCCAAACACACCCCCGTCGACCTGTACTACACACACAACCACGTCATGCTGTATGAGAGAGAGCTGATCGCGGCGCCCTGGAGCCAAATAATAAGCAACGCGGCCATCGTGGACTTCCTCATAGAGTCCGTGGAGAGAGGCTGGGAAATGTCCGCCGCGGAGTGGGACTTCACCACCATCACGCTCTGCTCGCTCATCACCTCGCTCAGGAACAGCGCCGAGGGTTGGGAGGCCACTAAGGTGTCCGCCCTGGCCCGTCCCGTGCTCCAGCTCCTGTCGTGCGTGTCGTCGTTCATCAGGGAGCTCCCTCGCCGCTGCGTGCTGGAGCAGCCCGCGCCGCACGTGGCCGCGCTCGTCACCGAGTGGAAGGACATCTTCGCACCGGACATCAACAGGAACCTGTTCGAGATGCTCGTCATCGTACTCAAGTCAGCGGACGAGCACATGACGTCCAGTAGAGTGGCGACCATCGCCGGCCTCATCACGGCCACCAAGCACATGGACTACCAGCACGTGAAGACGAGGAGCACCGACACCGAGCTCAGGAACATAGCGAGCGTCGCCATACGAGTGCTGGACAGGATGACGCACCACGCCTACAAGTACCTGGCCTTCCACACGCTCGACCTCATCTCCAAGCACATGGTGCTGGACGACGCTGAAAAGTTATCCGAGTGGAGCGCCCGGTCGGACGACAGTCCCCGGCCCGAGTTCTCCCTGTCCTATTACGACGACACGCTGATCCGGCTACAGGAGATGGTGGACGCCGCGCTCGAACACGTCGACGGTGTCTGTGGGTCCCGCGTGGTGTGGTTGTCTGCGGGGGGGCGCGTGTCCTGCTCCCTTTTGCTGCTGGCCGCGGCCGAGTTGAGACACGCGGCCGCCGCCCGGACCGACCTCGCGCACATGTACATAGAACTGTTCAGGGAGAACAAGTACGCGGAGTGGTGGATGCAGACGACCCTCAAACTGCTGCCTCAGGAAGTGGCGGCCTACGCCCTCGAGGAGACCGACACGCTGCCCGACCACTGTCTCAAGGACTTTGACATTCTGCCCGAACTAAACGTCTACGGTTGGTGTAACGGCCGCACGGTGACCCGCCTGTCGTGCTGGGTGTTGACGGCCACCTTGGGCGGATCGGGGGCGGGCGGTCGCGCGCCCTGCAGCGCCTGGTGCCGGCCGCGGTGA

Protein sequence:

>DPOGS210232-PA
MGGKTKQSQRTKNNVRPSSSGRSAEILSNSLKLDSSYVITGSGKVLPALFPTLTTTPIDQGLSPEYTVCFKKFHKKDPITKTKALQELTELVKNGDKEEVVAALPSWANIYQTLTVDGDRRVRETVQTCHSAVISTCGRRAAPVLRRILPAWLLASVDDHGPAQTAALNSLQNTFPDGKLAEAISFCKSEILSLLIDNLTGNAEAMLSNKVEDVEERSMQVSRLLGCSLLALRAVLLKLTEHLQHIEQLEEQRDWLTGQLEHLLGNNAFWKLATHESQHVRVALLPLPYMLARLTNNLTPPPSHRYSWYSCMGSLWSVYPGAPPPSHSRRLLRALTNRPESGASPWSAMLLIANSHPNWHEWLDKKDLLVKRVVGVLESGGGGECRVLSESLRGLLSSLPRDMKTKDFWSTFFDAAFKGLKNKNLLSSRSERQGWISNIAECLKYVSEQNDDWVVEVITEVHRTWLELVATAQDRLTKTNMIKHSGTQMAMLVKHWLQNSKDLSDEKYDQLIRNYWQNIGSTIASHIDILSLERSDIANSIEAHTVMLKALKTGYHEDARQQRCIKFADEDDKTQKEPIKVELDRRLQERFEHNLHDVADNTCCSYFDFAQTKQVSKWVFPSLQPLLAEFECERLFRSLARHFGCQNVFGFYDKVLKTWLSGDTMRCNALVDVMFTLAQYLNESELEAMYDSFQQFPPQVVEWCLSPSLSLSRGSSSIAASWLRGPVCEAAVVGLSKRPEDPAARQLLLECMAVDERGELRVSESTVVKVLSVLSSLMPVPPPSEVSSLASSLVLSLHATNVSDRCQQLVLVMFELVLKHPKSSSLHSSLLECLRVLGRECSSEALRRARAWLYHDIEELDMSRIEHVASLCPHVLLPSESVPADTSDLSALTRALMDDQLRPTVPLEVFSLRCDCIFGNINCPIEDDNDVVKTIVATSDMDHAELSKMDLMVHVYKSLFRSFFIQAIVTRHADIMADVFFREQYALCLYEYVIIKTLYDRYAFWCHYEIIYETKNRMDLVLDDIFTKTPYKHKASLLIYLSEQAAAKGYYWSYATRLFDDKVQEYVKMNPDNEPICEVEKEVCVEADLSKQVLDSMCIEKLKDIMTGNGFFHSLQAERGGVQRTRYMVMLRSVFAAHSHEPDVLRAALSPGWGSKHTPVDLYYTHNHVMLYERELIAAPWSQIISNAAIVDFLIESVERGWEMSAAEWDFTTITLCSLITSLRNSAEGWEATKVSALARPVLQLLSCVSSFIRELPRRCVLEQPAPHVAALVTEWKDIFAPDINRNLFEMLVIVLKSADEHMTSSRVATIAGLITATKHMDYQHVKTRSTDTELRNIASVAIRVLDRMTHHAYKYLAFHTLDLISKHMVLDDAEKLSEWSARSDDSPRPEFSLSYYDDTLIRLQEMVDAALEHVDGVCGSRVVWLSAGGRVSCSLLLLAAAELRHAAAARTDLAHMYIELFRENKYAEWWMQTTLKLLPQEVAAYALEETDTLPDHCLKDFDILPELNVYGWCNGRTVTRLSCWVLTATLGGSGAGGRAPCSAWCRPR-