Monarch geneset OGS2.0

DPOGS200102
TranscriptDPOGS200102-TA2325 bp
ProteinDPOGS200102-PA774 aa
Genomic positionDPSCF300044 + 322051-332650
RNAseq coverage1769x (Rank: top 7%)
Annotation
HeliconiusHMEL0052520.087.33% 
BombyxBGIBMGA004550-TA0.081.06% 
DrosophilaCG8907-PB2e-6537.86% 
EBI UniRef50UniRef50_D7EIL40.049.93%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D7EIL4_TRICA
NCBI RefSeqXP_001808875.10.049.93%PREDICTED: similar to GA18075-PA [Tribolium castaneum]
NCBI nr blastpgi|1892419960.049.93%PREDICTED: similar to GA18075-PA [Tribolium castaneum]
NCBI nr blastxgi|1892419960.049.94%PREDICTED: similar to GA18075-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055151.4e-14protein binding
KEGG pathway 
InterPro domain[38-162] IPR0136251.4e-14Tensin phosphotyrosine-binding domain
[496-569] IPR0014521.5e-14Src homology-3 domain
Orthology groupMCL13477 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200102-TA
ATGGCGCTGCGAAACGCTGGCGGGGGGGCTCCCCGGGGTAGGGCTTCCGACGCCACTGCGACCCGAGCTCTTGACGAATTTGACACATTAGCTGGTCTGCGACGAAGTACTTATGCTCTAGAACACCTGGCTACGTTTACAGTGACAAGGGAAACTGGGATAGTATACCCCGCTGATGGCATGAGAAGATTGCTTCAACTCGAAAAGACCAATGGAATATGGAGTCAAAAGATGCAACTGTCTTTAGAAGGTCAATGGGTGCTAGTTATGGATTACGAAACGGGGTCCATCATGGAACGGTTTCCAGCGTCGTGGGTGCATTCACCAACAGCATTCACATCCCCAGAACCAGCTGAACTGTACAATAATGTGTTAGCATTTGTTGTCGAAGCCCCGGAGTCGGGAGGCTCTCCAGCGGGTGCTAGGCGTGAGCTGCACATCTTCCAATGCCACGATGTTGGTGCACAAGCTCTTGTTGAAGAACTTAATGCACTTAAGGGCGAAGGCGGCGGAGGCTCTGAAGGAGGAAGAGACTTCGTGATTGAAAGAGAAAGAGAGAGGGAAAGAGAAAATGATTTAGACAGACCAAGACGTCAGCAAATGTCCGCACAGCTCGGCCGCGGAGATCGACCAGATCGTGATCGTGGAGGGTCAGCTGGTGAGCGCGATGATGCTTCATCTACAGGCTCTGAGAGACTATATGAGCAAGACATCGCGATCCTCAATAGATGCTTCGACGACATAGAGAAGTTTATCGCCAGGCTGCAACATGCAGCGGCAGCATCAAGAGAACTTGAGAGAAGGCGGAGATCGAGAGGAGGAAAGAGGAGTGCTGGGACTGGAGAAGGAATGCTGGCACTAAGAACACGCCCTCCACCTGAAAGAGATTTTGTTGATGTGCTCCAAAAGTTTAAACTGTCTTTCAACCTTCTGGCCCGTTTGCGAGCTCACATACATGACCCTAATGCTCCAGAATTAGTGCACTTTCTCTTCACACCATTGGCTCTGATAGTAGATGCAGCACAGGATGTCGCAGACGGTCGTCTGCCAGCACGTGTGGTACAGCCACTGCTTACTCGAGATGCACTTAATCTGCTCGCCAACTGCGTAACCAGCAAAGAAACTGAGCTATGGCACTCGCTGGGTGATGCTTGGCTTATACCAAGAGAGCAATGGAAAACGACAATCCCTCCCTACCAGCCGGTGTTTATGGACGGCTGGTCCCCAGATTACCAGGTGGACGACCAACCTTTGCGACGAGCATCTCCAAGAAGAAGTGAAGCTGGTAGAGGTGGAACGGGTGCATTAGGAGAAGAAGCGATCAGGGAAGGAGCAGACAGGGCCGACGGCTACGGGTATGAGAGGGAAGAACCTGACCTATACGGTGAACAATACACACCTTACTCTAGAAATCCGCGTACTCTGACCCGAGAAGATTCTGGCTCAGCGGCTTCCTCTCCAGAACGCGAACCACCATATAGAGCAGACAGAGATGAAGATGAGCTAGGTGAAGCTTGGGCTCGTGGGGTTGCAGCCCGTGGCGGTCGTGTCGTTCGCGTCACATACCCACGCACCGCTAATAATGACAAAGAGCTGACAGTTGTTCGTGGGGAATACCTTGAGGTGCTAGATGACTCTCGCAAGTGGTGGAAGGCGCGCAATCGTCGTGGTCAAACTGCCCATGTGCCGCACACTATTGTAGCACCCGCCATCTCACCGCCTGGTTCTCCAGCACCTGATGCTCTGCTTTATCCCAATCCTATCTACACGCACTATCAAGAAGGCGGCGGCAGAGGTTCCGGCGGCAGCAGTCCAACGGGTCAACCTGAGCCGGAGCCACGAAAGCCTGAAAAGTCTGTACCCCCTCCCCCTCCTCCTCCACCTCCGCCACCAGAGCCTCCCGCACCAGCTCCAGTCTTACCAGCCAAAACCGATACAATGAAATCCACAAAATCAGCTATTAGCACCAGCAGCGGTCTGCATGAGGAGTTGAAGATGGTTCTTCCTCAGATCACACAACGACGTCTTGATATAAAGAAAACACCAGATATATTTATCCACCAGAAATCCAATCCTGAAGAGGTAGTCCAATGGCTTGAAGCTAAAGGCTTTAGTAGCACAGCCCAGAAGCAGCTGCGTGTATCAGGTCATCAATTATTCGCTCTGTCAAGAAGCCAACTGGAACGAGTGGTAGGAGCTGATGAAGGGAAGAGGTTGTACAGCCAGATACTGGTGCAAAGAAATGTATCTGGGTACAAGACGACGTCCGCCTCAGAGCTCGCCAGCATCCTGCGGAAAGTTCGCGAGAAAGTAGAAGTATCCTGA

Protein sequence:

>DPOGS200102-PA
MALRNAGGGAPRGRASDATATRALDEFDTLAGLRRSTYALEHLATFTVTRETGIVYPADGMRRLLQLEKTNGIWSQKMQLSLEGQWVLVMDYETGSIMERFPASWVHSPTAFTSPEPAELYNNVLAFVVEAPESGGSPAGARRELHIFQCHDVGAQALVEELNALKGEGGGGSEGGRDFVIERERERERENDLDRPRRQQMSAQLGRGDRPDRDRGGSAGERDDASSTGSERLYEQDIAILNRCFDDIEKFIARLQHAAAASRELERRRRSRGGKRSAGTGEGMLALRTRPPPERDFVDVLQKFKLSFNLLARLRAHIHDPNAPELVHFLFTPLALIVDAAQDVADGRLPARVVQPLLTRDALNLLANCVTSKETELWHSLGDAWLIPREQWKTTIPPYQPVFMDGWSPDYQVDDQPLRRASPRRSEAGRGGTGALGEEAIREGADRADGYGYEREEPDLYGEQYTPYSRNPRTLTREDSGSAASSPEREPPYRADRDEDELGEAWARGVAARGGRVVRVTYPRTANNDKELTVVRGEYLEVLDDSRKWWKARNRRGQTAHVPHTIVAPAISPPGSPAPDALLYPNPIYTHYQEGGGRGSGGSSPTGQPEPEPRKPEKSVPPPPPPPPPPPEPPAPAPVLPAKTDTMKSTKSAISTSSGLHEELKMVLPQITQRRLDIKKTPDIFIHQKSNPEEVVQWLEAKGFSSTAQKQLRVSGHQLFALSRSQLERVVGADEGKRLYSQILVQRNVSGYKTTSASELASILRKVREKVEVS-