Monarch geneset OGS2.0

DPOGS200349
TranscriptDPOGS200349-TA2961 bp
ProteinDPOGS200349-PA986 aa
Genomic positionDPSCF300026 + 553787-556747
RNAseq coverage362x (Rank: top 33%)
Annotation
HeliconiusHMEL0000370.072.73% 
BombyxBGIBMGA005642-TA0.059.52% 
Drosophiladome-PA2e-3225.60% 
EBI UniRef50UniRef50_D0AB850.072.83%Putative tyrosine phosphatase n=3 Tax=Nymphalidae RepID=D0AB85_9NEOP
NCBI RefSeqXP_001807060.12e-7130.13%PREDICTED: similar to tyrosine phosphatase [Tribolium castaneum]
NCBI nr blastpgi|2613359470.072.83%putative tyrosine phosphatase [Heliconius melpomene]
NCBI nr blastxgi|2613359470.072.83%putative tyrosine phosphatase [Heliconius melpomene]
Group
Gene OntologyGO:00055157.5e-11protein binding
KEGG pathway 
InterPro domain[392-478] IPR0089572.1e-18Fibronectin type III domain
[399-478] IPR0137839.1e-16Immunoglobulin-like fold
[404-478] IPR0039617.5e-11Fibronectin, type III
Orthology groupMCL15803 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200349-TA
ATGTGGAGTATTCCTAATAACATGGCCGATTTACTGCCCTGTGGCGTTGATCATATAATAGAATATCAAATTGCTAAAATTGATAACACTACACATTTCCGAAGAGTCAATGCATCATTCCTACCTCCCAAAAATAAAATTTATAGATTTCAATTGACAGACCTTCCATATGCTCATAAACAGTATGAAGTAAGGGTGTTTATTAAGTCTAAAAAGGCTACAAAGAGAGAGTTATGGTCCGACTTTAGCTATGTTGTTTTCTACACTGCAAGTGAAAGACCAAAACGACCCCCTGATACTATAGCTGGTGCTTTCCATAAATCTGCGTATAGCAATAATAGAGTCATTTATGTGTATTGGAAACAGCTCGAAGAATATGAGGAGGCTGGAGCCAACTTTACTTATAAAATCCTTGTGTCTCAAGATAATAAGACACAAACTGTGTTCCCAGATAAAAACAAGAGTTTGAGTTATGTCAGACTTAACGCAACACTTAATGCATTGGACATCACAGTTTGGTCACTAAACAACAAAGGCACTTCTCTTAATAGCAGTCACTTATACATCCCTGCTGAAAAAGACACTCACTCATTAAAACTAACTTCATTTACAAAATTAGCATATGAGAATGGAACTTATGAGTTATCGTGGGTTGCCATAAAAAATATTGACAATTACACCCTTTTCTGGTGTCAACATAATGCAACACAGATATGTGTAGGAAGAATGGATTTTGCTGTTCTGAGTCCAGATAAAAGTAATCATGTTATTGATTTACCAAGAGAATATAGATATCAGTTTGCAATATCTGCAAATAATGGATCAAAAACTAGCGGAATGGTGTGGGCCAACTGCGATATTTCAAAGGATGGATTTGTTATGTATGGTTTCCCAGTTCATTTAAAATATGATGTACCAGGCAAGTCACATGTGACTCTAAGATGGCTTATGGATTGTGCTCTTCAAGATGGTATAATTACGGGGTATAATATATCCTACTGTCCCATAGTCCAAACAAGCAGTTACTGTGACAAATCTTTTAACAATAGTTACAAGTTCATTTCTAACCCTAAGCAAATGCAGGTTACTATAGAAAATTTATTTCCATTCCGAACATACCAGTTCACTATAGCCTTGAATACTATTTATGGACAAAAAACTATAGAGAATGCTACAGCTGTTATAACAACTTCAGAAGATACTCCTACAAACCCTGTAAATATATCAGTATCTAACATAAGAAACACCTCACTTGTTATTTCATGGGATCCACCAATACACAAGAACGGAAATATAGGAAAATATGTCATTTATAATTATGACAAGGAGTTGTATGTTGACAGTGTGTCTGGAACTGACACATCCCGGAGACAAGTGACAATTACTGGACTGCAGGGTTTTACTAAGTACTCCTTAACTGTGCAAGCCTGTAACATTCCAATAGGTTTATGTTCAAAAATAAGATCTAATGACTCTGTAAACGTAACCACAAGGATAGGTGCTCCAAGTAGACTAAGAGCACCAACTGTAAGGAACAGTCCAAATAAACTTCAATGGGAACCCCCCGAAATTCCCGGAGGAAGGGTGGATCTGTATGAAATAAGAAGGATCAAAGATGATCTTGCACCGGAAATTATTAACACTACAGAATTGTCATATCCCCTTGAGTATTGTGTAGGAGTAGTATCAACAGAGACATACCAAGTAAGGGCGGTTAATTTCGATGTGACCTCATCTAGAAAAGGTATTAATGAACCTCCAAAAGAATATGTAGGACCATGGAGTGAATCAAGTGTTGTTGCTTGTAGGACTAGAGATGGTTTGACAATGATATTAATTATCATGGCAGTTCTCTCACTAACAGCGATAGTTGTTTACGGTTCTATAAAGCAATACAAAAAGTATCGGAAAATGGAGGATATCAAACCAGTTTTACCAAGTGGTTTAGGTATTCCTGAAAAAGATATATCCAAGTACACTTTCGGTAATTGGAATCCCACAACTAAGGAAGAGAAGCCGTCCTCTGATGAAATGTTATTATTGCCTAATTCTAAAACAACAGTTTCATCTACCGACACTAAACAAAAAGATGACAACTGTGCTTCAAGTGATCATACAGATAGCACTGCTTTGTCAGAGTCTTCCAGAGGCCCTGTTGAAAGACAAGCTTCAACATCAGAAGAAGGCTCTGAAACCTCAGACCATTTGGAGGTAGTCGCCGATAAAGGAGAAATCAGCAATGAAATCCAAGAAGAGGAATCTTCAGCATCAGATACTGAAACTTCCCCAGAAAATTCGCCATACTTTAGTGATAAAGCTTTTAAGAAAAATCCAACCAGTGGATACGTGCAACCGGTAGTGAGTACTACTTCCGGATATGTTCAGTCAGCACCAGCGCCAGTGCAAACTAAATGTCCATCTCAGAGTACTGCAACACAACCAGCCAGTAACAGTTACGTTATGGCTGGCTTGCCTCCACCAGTATTTGTTCCTAACTCAGCGACGGCTACAAATCCGCCCACATTATCTTCGGGCTACGTACTACCCGAAGATGTACAAGCCAGGTCTATGATGAATACTAATAAATTCGGACCATCGATTCCAAAATCCGTTGGACCAGAAAGTTTACCAACGATGCCTTCCTTGCCACCTGCAGCGAAACAAAGCGCCGACAATAGCTATATCCAGTTACAGTCTTTAGACTCCTTGCCTAGTCTTAAGTTAAGTGAGCGTAACTCATTCCCAAAACAACCGTCAAGCGGTTATGTAAGTCCAGGGGATGTCGTCATAAATAAACATCTGAACATTTTGACGGGCGGTCAGCACGCCGAGGAGTCTGCGATATTAGATCCAACGATGTCTCCGGATGCATATTGTCGCTTCTCCTGGAGCAATGACCCCGCCAATGATAATTTAAATACCTTACTCAGCGATTCCCCCACACGGACGTACAACAATTGA

Protein sequence:

>DPOGS200349-PA
MWSIPNNMADLLPCGVDHIIEYQIAKIDNTTHFRRVNASFLPPKNKIYRFQLTDLPYAHKQYEVRVFIKSKKATKRELWSDFSYVVFYTASERPKRPPDTIAGAFHKSAYSNNRVIYVYWKQLEEYEEAGANFTYKILVSQDNKTQTVFPDKNKSLSYVRLNATLNALDITVWSLNNKGTSLNSSHLYIPAEKDTHSLKLTSFTKLAYENGTYELSWVAIKNIDNYTLFWCQHNATQICVGRMDFAVLSPDKSNHVIDLPREYRYQFAISANNGSKTSGMVWANCDISKDGFVMYGFPVHLKYDVPGKSHVTLRWLMDCALQDGIITGYNISYCPIVQTSSYCDKSFNNSYKFISNPKQMQVTIENLFPFRTYQFTIALNTIYGQKTIENATAVITTSEDTPTNPVNISVSNIRNTSLVISWDPPIHKNGNIGKYVIYNYDKELYVDSVSGTDTSRRQVTITGLQGFTKYSLTVQACNIPIGLCSKIRSNDSVNVTTRIGAPSRLRAPTVRNSPNKLQWEPPEIPGGRVDLYEIRRIKDDLAPEIINTTELSYPLEYCVGVVSTETYQVRAVNFDVTSSRKGINEPPKEYVGPWSESSVVACRTRDGLTMILIIMAVLSLTAIVVYGSIKQYKKYRKMEDIKPVLPSGLGIPEKDISKYTFGNWNPTTKEEKPSSDEMLLLPNSKTTVSSTDTKQKDDNCASSDHTDSTALSESSRGPVERQASTSEEGSETSDHLEVVADKGEISNEIQEEESSASDTETSPENSPYFSDKAFKKNPTSGYVQPVVSTTSGYVQSAPAPVQTKCPSQSTATQPASNSYVMAGLPPPVFVPNSATATNPPTLSSGYVLPEDVQARSMMNTNKFGPSIPKSVGPESLPTMPSLPPAAKQSADNSYIQLQSLDSLPSLKLSERNSFPKQPSSGYVSPGDVVINKHLNILTGGQHAEESAILDPTMSPDAYCRFSWSNDPANDNLNTLLSDSPTRTYNN-