Monarch geneset OGS2.0

DPOGS211591
TranscriptDPOGS211591-TA4674 bp
ProteinDPOGS211591-PA1557 aa
Genomic positionDPSCF300084 + 369056-379042
RNAseq coverage1362x (Rank: top 9%)
Annotation
HeliconiusHMEL0169890.082.75% 
BombyxBGIBMGA006365-TA0.078.31% 
DrosophilaCG7029-PA1e-2728.78% 
EBI UniRef50UniRef50_D7EJC90.039.03%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D7EJC9_TRICA
NCBI RefSeqXP_972733.10.039.03%PREDICTED: similar to CG7029 CG7029-PC [Tribolium castaneum]
NCBI nr blastpgi|910942110.039.03%PREDICTED: similar to CG7029 CG7029-PC [Tribolium castaneum]
NCBI nr blastxgi|2700162080.038.73%hypothetical protein TcasGA2_TC016256 [Tribolium castaneum]
Group
Gene OntologyGO:00055152.3e-07protein binding
KEGG pathway 
InterPro domain[1468-1555] IPR0014522.3e-07Src homology-3 domain
[1061-1150] IPR0040122.1e-06RUN
Orthology groupMCL17804 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211591-TA
ATGGGGGTGACGGAATACGATTGGGATACTTGCAATGGCTACTATTCAGGCGAAGAGTCCGTCGGTTCCATCAACACGATGTCTATATGCAAGTCGGAGCTGTTATTTTCACCAGTGAAGGAAGGTGCCCACGGAGTCCACTTCAGCGTGGATAGTCTAGACTGTGAACTGCCCACAGAACAAGATCTCATACTCACATGCCAGGCTAATAAAGATAACTATACCATCGCCTTCGAGGGCAGTCTCACCATTTACTCTGAAGACAGTGAGTGTGCCGAACCCGCCGTCAATCAAAAACATGACAAACTGGTTAAGGAGGAATTAGTTATAGGTTTAAATAGTGATGAAAGAACGCGCAGGAATTTAGAATTATTAGAGAGATGTAAAAATTTAACCAATAAGCTATCAACATCTATGGCTAGAAGCGATTTAGGATTAACAACGTGGAGTAAACTTAAGAAACAAACCAATCAGTCGCCTTTACAGAGACATCCATCGGGAAACAACAACGAAGACTCCCACGACTCCACCCAAGCCACTAATGATATGAATAATTCGGTTATAAAAAGTCAAAGCTTACCAAACTTGTATAGAAGGAAACTAATGAGTAGTTCTATTAATTCCGCGGCATTAAGCAATTCAACCGTCGACTCCTTCTCAGCCAACCAGAGGTTAATTGGAACTTCAACATGTATGAAAGTATATGACGTCTCACAAAATCGTCAATCACATACAAGTCAACATTCAGAACCTATGAGTACATCGTCTACTGAAAACCAGACCTCAACGGATAAGAGTCAAACGAAACAACAGTCCTTCAACTTGGTTAAACTATTTATGAAACAAAAGAGCATCAGCAACGATGGTATTGTTGGTATGGATCAAATGGATAGGTCAGAGTGTTGGCCATCTAGCTCGGGAGGGGAGAGCGGGGAGTCTATGGGGGAACAGAGACTCTTAAAGACAACGATCTCCGAAAGACCCCAAGTGGAACTGCCCGCTGAGGCCCTTGAAGATGATACAGCATCAGTTGTATACGATGAAATTAGACCAACGAATCCATTTAATAGAGTGTACGACGAAGTACTTATTGAAGAAGAAGAGAATGAAGGTTCTAAGTCGAGTGATAATGAAGCAAATCTTTACGCTATAGTCAATAAACCACTCAGGAAGAGAACAAACTTAAACGTTATGAATAGTCCGAAACACAGACCGGCCAGAAAGTCGTGTTCATCTCAATCATCAGCTACCAGTGTTAGTTTATCAAGTTGCTCTGAGTCAGATGGAACACAAATCACGAAAATGAACAAAATAAAAGAAAAGGAGTCCTTTGACTCCAAAGCAACATCCACACATCTAGATACCGAAACCATAGATAAAGGCATGCAGACGTCCAGCATGCAAGTTTCGACTGCCTCTCATGATCGGGAAATATTTAAAGTGGTCGAGGCTTCATTTCTAGAAAAATTAAAGGAGGGTGATTGTGAGAAACCAGTCTTTGTACTTTATCCTAATTACACCCTGCCTGATATAAGCTTTCTAAATGGTAGACCAAATATATACTTGAATCCTGTTAAAGTCAATGTTTCGCCGAAATCAGATAAAAAGGGTAGAATGCAAGCCAAAGGCAAACGCCCCTTTTCATGCAACGATGTGGAAATGCTTAAGAAAAAAGGTTTGGGACATATAAGGGACTGGGACTCTCTAAACTTCCTTTTACCTATGGAATGTAGACAGCTGCTTTCAGAAGTTCCAGAATTAATGCAACATATGAAAGAGAAAGAAAATAATCGACCTACGAGTTGCGATTGCAACAATCTGACTGGCAATACAGCTGTCTCGTCCAGTTCAAGCACGGCGACTCAGCCTTCATCTGGATATCGAGGATCTTCAACCATATTAACCGATTCTTCGGCTCAAAACAGCCCCGCACCCACTGGAAACTTCAATCCTCTATTCGTGTATCGATACGACAGTGCTACGAGCTCAGAAGCTAGCTGTGCAAATAACGAAGGCCAAAGAATGAACAAAAGACAGTTATCGCTAGCAGATCAAACTAGAATTTTGAAACAAGGAGAACTCGTCCCTCCCAGGCCGCCTCTACCAAAAAGTATTTTACGTAAATCCATGGATAAAACTCGCAAAACCAGTGCCCATACGAAACGTTACAGCATGTTCGAATTAGATGAATTCATACAAGACCCAGTGGTATGCGCCCAAGCAGGCCTAGAACAGAAAAGTAAAAGAAGATCACTACAAGAACCATATTATCTTCAAAATCAACCTACTTTAGACTACAGAAAGAACAATGACTTAGCCGCTAAGAGATTGTCCCAACAATTTTTGGACGCCGCAGACAAAGATGCCGATTACAACGAGTACTACCAAGATGAAGGGGTGGGAACTGAAAGCAGTTTGGAATCTGGAAAATCCAATGAACTCAAGTTTCATCGCCCTCATACGCCGCCAATGCCAAAACCTAGAACAAAGAAAATCGAATACATAGATTTCCCTCCACCCGGAGCACTCATCAGCAGTGCGGACCTACAGCAGTTGGAAGACTTTCTGAAATTAAGTGGCATTAACTGTCAAAACATGGACGAGTGGGACCAAAATCAAGTTCAAAAGGTAAGAAGTCAGGTTACCAAGTTCCTACAAATGAAACGTTCCCAGGAGGAGAATCAAAGGTCTACAGACTCCAGTACGAGTAGTTGTAACAGCAAGAAATCTGTTAGTTTCGCTCAAAAGCAGGACGGTAAAGAGATTCAAACACAAACAAAGGCTGTCGAGGAACATAAAGTATCGACACTAGCCACGCCACCCAACTCTCCAAACGTATCAGCTATGGTAGCTCAAAGGCACTATCAGGGCAAAAACTTGGCTGAAATACCTATCTGCGAAGAGGGCGAAGTTAGTCCCGATGAATTCCTCAGCCCAACTCATCACGAGACCCGTCTGAAGTATGACATCATCGACGTGTCACAGAAACGAGCTTTAGTGTCAAACGTAACCGACGCTGTCGAGATGTTGATCCAGCACTTCTCGTCGGCGATGGACCAAGCGGAGTTAGCGTATTTAGGAGATTCAAAACAATCTCCCGCCTGTGCTAAAATAGCATTAAACGCGTTGTGTCCAGCCCTGTACGCCATATTCCGAGACGGGCCCATAACGAAGTCATTAAATGAACTGGTTTTGAGAATAAACAGCGAAGATGCAGTAACCGAGGGTCTTGTGAAGTTCAACGCCTTTATACTAGGATTATTAAATGCACAGTCAGTCGACGCTTGGGTATCGTACGTCCGAACGAGAGAATCAATTTTGGCAAAACACTACGATCAAGATTCTCTGATACTAGCGGGTTGTATTGGAGAGCCGCGCTGTAGGGCCTTACTGGACACACTCCTCGCCAGCCTCGAGCCGCTCAAACTGTTGCCGTTCTCCCTTGACCTAATGTTCGAGATGAGGGAACTTCATAGGAGCTTTAAAAAGATTGAAAGCGACATGCGAGCCGCCAGTCGGCCAACCTCGTTTAACACACCGCAACTAACAACCAATCAGAGAAATCTGCTCAAGTTAGTGCGTTCGATGCAATCCAGTGTCGTGTCCAGCGACGAATGTCAGGCTAGCGTCATCATGAGACAGAAGGAGCCCAAGAACAAGGAACCGTCCACCCCGGACCTGCTGAACGATTCGGCTAACGTCAAAACTACCCTCGATAAGAATAGACCGAGGTCGTGCGTCAACCCCACGCCCGTAGGCTACGACATGTGTCCAAACAACAGTAGGATAGACTTGGAGAACAACAGGAGGTGGTCCGGGGTTCAGTTGGGATCCAAACTGATGCAGGCCTTCGACAGGCTGGTGTTCGACGATAGCGATGATTACACAGACAGTCTAGAACATAAGACCTGCGCTAAAACGACCGACGCCAAGCTGGATACGAGCGGGGAAGAACAATGGCGGCCCCCTTCAGCCAGCAGTAGCGCCAGTGGTCAAGGATCCGGCGGCAAGTTCCGGAGACTTCAGCTCAAGTGGGAAATGCTCAGTAACGCTGAAAGCCCCGGGACGCCATCTGGTGAAACCTCCCCCGCGGCTGCTCGGGGTTCCAAGATCCCCCGACCCGTCTCATCCCCGGTCAGACCTCAAGCTCCTCCAATACAATCACCAGCCAAAAACACGCAACACAGAGGTATACCAGTGGCTGTACGTAAAGGTTCTTCCCCGACGACAGCAGCGCCTCGAGCGGCTAACACGAGAAACAAAAAACCACCACAGCCAGCTAACCGGGTTATACCCGAGAAGGCGGCAGGAAGACCGGAAATAAGAAAAAATGTAGCTGTTAAAAACACTATCGGTGCTAAGAACACGCTCGCGTCCCGTGTGGACGGCATCACTACGGGGAGTCCCCGTCCGTCGTCTCTGCCTTATGGACGCGCCGCGCCCCCGCCCGCCCCGCGCCGCGCCGCCTCCTCCTCCGCGACGAGAGCGACACACATCAAACACAAATACGTCAGAACTCTATGGCACAGACTGCCGTCCGACTCGGGACACCTGGCGTTCAACGAGGGTGAGAGGCTGCGGCTCATACTGGAGGTGGACGCACAGTACTTGTTGTGTTGTAGAGGAGAACAGAAAGGCCTCGTTCCGCGGGACGCCGTGCTCGAGGACTTCTGA

Protein sequence:

>DPOGS211591-PA
MGVTEYDWDTCNGYYSGEESVGSINTMSICKSELLFSPVKEGAHGVHFSVDSLDCELPTEQDLILTCQANKDNYTIAFEGSLTIYSEDSECAEPAVNQKHDKLVKEELVIGLNSDERTRRNLELLERCKNLTNKLSTSMARSDLGLTTWSKLKKQTNQSPLQRHPSGNNNEDSHDSTQATNDMNNSVIKSQSLPNLYRRKLMSSSINSAALSNSTVDSFSANQRLIGTSTCMKVYDVSQNRQSHTSQHSEPMSTSSTENQTSTDKSQTKQQSFNLVKLFMKQKSISNDGIVGMDQMDRSECWPSSSGGESGESMGEQRLLKTTISERPQVELPAEALEDDTASVVYDEIRPTNPFNRVYDEVLIEEEENEGSKSSDNEANLYAIVNKPLRKRTNLNVMNSPKHRPARKSCSSQSSATSVSLSSCSESDGTQITKMNKIKEKESFDSKATSTHLDTETIDKGMQTSSMQVSTASHDREIFKVVEASFLEKLKEGDCEKPVFVLYPNYTLPDISFLNGRPNIYLNPVKVNVSPKSDKKGRMQAKGKRPFSCNDVEMLKKKGLGHIRDWDSLNFLLPMECRQLLSEVPELMQHMKEKENNRPTSCDCNNLTGNTAVSSSSSTATQPSSGYRGSSTILTDSSAQNSPAPTGNFNPLFVYRYDSATSSEASCANNEGQRMNKRQLSLADQTRILKQGELVPPRPPLPKSILRKSMDKTRKTSAHTKRYSMFELDEFIQDPVVCAQAGLEQKSKRRSLQEPYYLQNQPTLDYRKNNDLAAKRLSQQFLDAADKDADYNEYYQDEGVGTESSLESGKSNELKFHRPHTPPMPKPRTKKIEYIDFPPPGALISSADLQQLEDFLKLSGINCQNMDEWDQNQVQKVRSQVTKFLQMKRSQEENQRSTDSSTSSCNSKKSVSFAQKQDGKEIQTQTKAVEEHKVSTLATPPNSPNVSAMVAQRHYQGKNLAEIPICEEGEVSPDEFLSPTHHETRLKYDIIDVSQKRALVSNVTDAVEMLIQHFSSAMDQAELAYLGDSKQSPACAKIALNALCPALYAIFRDGPITKSLNELVLRINSEDAVTEGLVKFNAFILGLLNAQSVDAWVSYVRTRESILAKHYDQDSLILAGCIGEPRCRALLDTLLASLEPLKLLPFSLDLMFEMRELHRSFKKIESDMRAASRPTSFNTPQLTTNQRNLLKLVRSMQSSVVSSDECQASVIMRQKEPKNKEPSTPDLLNDSANVKTTLDKNRPRSCVNPTPVGYDMCPNNSRIDLENNRRWSGVQLGSKLMQAFDRLVFDDSDDYTDSLEHKTCAKTTDAKLDTSGEEQWRPPSASSSASGQGSGGKFRRLQLKWEMLSNAESPGTPSGETSPAAARGSKIPRPVSSPVRPQAPPIQSPAKNTQHRGIPVAVRKGSSPTTAAPRAANTRNKKPPQPANRVIPEKAAGRPEIRKNVAVKNTIGAKNTLASRVDGITTGSPRPSSLPYGRAAPPPAPRRAASSSATRATHIKHKYVRTLWHRLPSDSGHLAFNEGERLRLILEVDAQYLLCCRGEQKGLVPRDAVLEDF-