Monarch geneset OGS2.0

DPOGS214815
TranscriptDPOGS214815-TA3459 bp
ProteinDPOGS214815-PA1152 aa
Genomic positionDPSCF300059 + 654092-674380
RNAseq coverage191x (Rank: top 48%)
Annotation
HeliconiusHMEL0063360.093.35% 
BombyxBGIBMGA012049-TA2e-7086.90% 
DrosophilaLar-PE0.068.56% 
EBI UniRef50UniRef50_P166210.069.06%Tyrosine-protein phosphatase Lar n=49 Tax=Coelomata RepID=LAR_DROME
NCBI RefSeqXP_971078.20.072.53%PREDICTED: similar to receptor tyrosine phosphatase type r2a [Tribolium castaneum]
NCBI nr blastpgi|1892351100.072.53%PREDICTED: similar to receptor tyrosine phosphatase type r2a [Tribolium castaneum]
NCBI nr blastxgi|1892351100.072.53%PREDICTED: similar to receptor tyrosine phosphatase type r2a [Tribolium castaneum]
Group
Gene OntologyGO:00055151e-19protein binding
KEGG pathway 
InterPro domain[516-653] IPR0089577.3e-29Fibronectin type III domain
[429-529] IPR0137838.8e-28Immunoglobulin-like fold
[242-327] IPR0039611e-19Fibronectin, type III
[64-143] IPR0035991.1e-10Immunoglobulin subtype
[70-132] IPR0035984.1e-07Immunoglobulin subtype 2
Orthology groupMCL10719 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214815-TA
ATGGGTGTCGTTATTTCGCTGCAGCGCTTTATTATCGGACAGAAAGTAGGCTGCGGTGAGAATTTCGCAATATATTCGCACGAGATAGCGCCTACGTTAACTATCTCGCTTCAGATAGTCGTGATTGAGGACGACAGATACGTGGACTGGCTTCTTCGTCGAGTGGCGCCCCAGTTCTCTATTCCGCCACCCCCGAGGACAGAGGTGATGCTGGGAGGGAATCTCACGCTCAAGTGCGTGGCCTTCGGATCACCGATGCCAACCGTCAAGTGGAGAAAAGGTTTGACAAAATGGCTTACACCGGAAGATAATCCTCCATTGGGCCTGAACACGCTCCAGCTAGAGGACATCAGGGAGTCAGCGAATTACACCTGCGAGGCAGCCAGCGTTTTGGGCGTTATAGAAACAACGGCTGAAGTGAAAGTACAATCACTACCGGGACCTCCGGCTGATGTGAGAGCATCCGAAATAACTGCTACCACAGTAAGACTCGCGTGGACCTACACCGGTCCCGAGGAGCCACAGTATTACGTCATACAATACAAACCAAAATACGCTAATCAGGCATTCAGCGAGATATCAGGTGTTGTGACGCAGTTCTACGCCGTGACGAATCTATCGCCGTACACTGAATACGAAATGTATGTGATAGCAGTTAATAACATAGGACGTGGACCTCCTAGCACACCGGCGGTTATAACGACAGGGGAAACAGAACCCGGCTCAGCCCCTCGTAATGTCCAAGTGCGTCCATTGAGCTCCAGTACAATGGTTATACAATGGGATGAACCTGAAACGCCAAACGGTCAGGTCACGGGCTACAAAATATATTACACTTCCGATTCATCGCAGTCATTACAGTCGTGGCATTCACAAATGGTGGACAATAACCACCTAACAACCATCAGTGAGCTGACTCCTCATACTGTGTATACCATAAGAGTTCAGGCCTTTACATCGGTAGGCCCCGGTCCTATATCCGCGCCCGTGCAAGTGAAAACACAGCAAGGCGTTCCCTCTCAACCTTCGAACCTGGTAGCAGTGGAGGCAGGAGAAACGTCAGTGACGTTATCCTGGAAGAGGCCAGCTCATGCTGGTGATAATATAGTATCATACGAACTGTATTGGAACGATACGTATGCCAAGCAACACCATAGGAAACGGATACCCATAACAGAGACATACACGCTGACCGGACTGTATCCGAACACGTTATATTACATCTGGCTAGCGGCGAGGTCTCAGCGAGGCGAGGGCGCTACCACACCAGCTATCGGAGTCCGCACCAAACAATACGTTCCTGGAGCACCTCCGATGAACGTGACAGCCACGGCCGTGTCTCCGACCGCGGTCAGAGTGTCCTGGCAGCCGCCGCCGGCCGAAAGGGCGAACGGAAGAATAGCGTATTACAAGCTGCTTTGCGTGGAATCGGGGAGGGGGGATTCCGAAGCCACGGTCGTGAGGCTGAATCAGACCAGCTTCGTGTTGGATGAACTGAGAAGATGGACTGAATACCGCATCTGGGTGATGGCCGGCACCAGCGTCGGTGACGGACCGCCCTCCTACCCCGTCACAATCCGAACGCATGAAGACGTGCCCGGAGAGCCCCAAGAAGTTAAAGTGACCGCAATCAACTCCACATCCATACACGTAACGTGGAAGCCGCCACAAGAAAAAGAAAAGAATGGAATCATTAGGGGGTACCATGTACATGTCCAGGAATTGAGGGAAGAGGGCAAGGGCCTCCTCAACGATCCAATGCGTTTCAACGTGATGGATGATACGACACTCGAGTTGAACATATCAGGACTACAGCCGGACACGCGGTACAGTGTACAGGTTGCCGCTCTCACGAGAAAAGGTGATGGGGATCGAAGTCCGCCTGTTACCGTCAAAACACCCGGCGGTGTTCCAAACAGACCAACAGTAAATTTAAAAATATTGGAAAGAGATCCTATTGTATCGATTGAGATTGAGTGGGCGAAACCAACACAGACATACGGTGACCTGCTAGGCTATAGGCTTAGATATGGTATTAAAGATCAATTACTTGAAGAGATTAACTTTCCTGGAACAAAGGTTAACTCGCATAGGATTAACGATTTGGAACGTGGGGTGCAGTATGAATTCAGAGTGGCGGGCAGGAATCAAATAGGTATTGGACAGGAAACAATTAAGTATTGGCTAACACCTGAGGGTGCGCCGAAAGGGCCTCCTGCTAACGTCACTTATCATTTTCAAACACCAGATGTAATTAGCATTACTTGGGATCCACCCACAAGAGCCGATAGAAGCGGTCAAATAAAAAAATATGACGTCCAATTTTATAAAAGGGGAGATCAGTCTTCATTAGTTGAAAAAACAACGGAGCTAATGAAAGCAGTGTTTACTGGGTTAGAAGAAGATGCTCATTACGTGTTCAAAGTACGGGCGTACACAGATCAAGGTGCCGGGCCTTATAGTAAAGATGTTACGGCTCACACTGAGAGGGACATCGGTAGAGCTCCCATGTCAGTTAAAGCCGTTGCGACCTCCGAATCAAGTGTCGAGGTTTGGTGGGAACCAGTGCCTTCGAGGAAGAAGATCATTGGCTATGTTATATTCTATACGATGACCCCTGTCGAGGATTTGGATGACTGGCAGCACAAAACTGTTCATGTCACACACTCTGCCGAGTTGGGGAACTTGGAGAAGTTTGCTGAATACGCAATCGCAGTGGCAGCTAAAACTGCAGAGGGACTGGGTAGGTTATCTGAAAAGGTCACTGTGAAAGTAAGGCCCGAAGAAGTGCCTTTACATCTTAGAGCACAGGACGTATCAACTCATTCCATGACATTATCGTGGTCCCCTCCATTACGCTTAAATCCAGTTAGTTATAAGATTTCCTATAACGCCATCAAAGAATTTGTTGACTCACTAGGAATGACGCAAACACAAGAAATTCCGAAGAGAGAAATAGTTGTTAAGCACGATAGGACCTCATACTCGATTAACGATTTATCGCCTTTCACTACTTATAATGTAAACATAAGTGCCATACCTAACGATAACTCTTATAGACCGCCAACGACAATTACCGTAACTACTCAGATGGCTGCCCCCAAACCGATGGTGAAACCTGATTTTTACGGTGTTGTTGAAAATGAAATACTCGTTATTCTACCTCAAGCGTCTGAAGAATACGGACCGATATCACATTATTATCTAGTTGTAGTGCCCGATGATAAGTCGCATAATCATAAGAACCCAGATCAGTTCTTAACAGACGATTTAATAAAGAATAATGCTCGCACGGACGACGAAAACGCGCCATATATAGCAGCAAAGTTTTTACAAAGAAACATTCTATACACGTTCCATCTAGGAAACGATGATATGTATGAAGGATTTTTGAATAGAAAATTGAATTTAAATAAGAAATACAGAGTTTTTGTAAGAGCGGTTGTTGATACTCCATAG

Protein sequence:

>DPOGS214815-PA
MGVVISLQRFIIGQKVGCGENFAIYSHEIAPTLTISLQIVVIEDDRYVDWLLRRVAPQFSIPPPPRTEVMLGGNLTLKCVAFGSPMPTVKWRKGLTKWLTPEDNPPLGLNTLQLEDIRESANYTCEAASVLGVIETTAEVKVQSLPGPPADVRASEITATTVRLAWTYTGPEEPQYYVIQYKPKYANQAFSEISGVVTQFYAVTNLSPYTEYEMYVIAVNNIGRGPPSTPAVITTGETEPGSAPRNVQVRPLSSSTMVIQWDEPETPNGQVTGYKIYYTSDSSQSLQSWHSQMVDNNHLTTISELTPHTVYTIRVQAFTSVGPGPISAPVQVKTQQGVPSQPSNLVAVEAGETSVTLSWKRPAHAGDNIVSYELYWNDTYAKQHHRKRIPITETYTLTGLYPNTLYYIWLAARSQRGEGATTPAIGVRTKQYVPGAPPMNVTATAVSPTAVRVSWQPPPAERANGRIAYYKLLCVESGRGDSEATVVRLNQTSFVLDELRRWTEYRIWVMAGTSVGDGPPSYPVTIRTHEDVPGEPQEVKVTAINSTSIHVTWKPPQEKEKNGIIRGYHVHVQELREEGKGLLNDPMRFNVMDDTTLELNISGLQPDTRYSVQVAALTRKGDGDRSPPVTVKTPGGVPNRPTVNLKILERDPIVSIEIEWAKPTQTYGDLLGYRLRYGIKDQLLEEINFPGTKVNSHRINDLERGVQYEFRVAGRNQIGIGQETIKYWLTPEGAPKGPPANVTYHFQTPDVISITWDPPTRADRSGQIKKYDVQFYKRGDQSSLVEKTTELMKAVFTGLEEDAHYVFKVRAYTDQGAGPYSKDVTAHTERDIGRAPMSVKAVATSESSVEVWWEPVPSRKKIIGYVIFYTMTPVEDLDDWQHKTVHVTHSAELGNLEKFAEYAIAVAAKTAEGLGRLSEKVTVKVRPEEVPLHLRAQDVSTHSMTLSWSPPLRLNPVSYKISYNAIKEFVDSLGMTQTQEIPKREIVVKHDRTSYSINDLSPFTTYNVNISAIPNDNSYRPPTTITVTTQMAAPKPMVKPDFYGVVENEILVILPQASEEYGPISHYYLVVVPDDKSHNHKNPDQFLTDDLIKNNARTDDENAPYIAAKFLQRNILYTFHLGNDDMYEGFLNRKLNLNKKYRVFVRAVVDTP-