Monarch geneset OGS2.0

DPOGS212132
TranscriptDPOGS212132-TA3318 bp
ProteinDPOGS212132-PA1105 aa
Genomic positionDPSCF300038 + 18623-30842
RNAseq coverage459x (Rank: top 27%)
Annotation
HeliconiusHMEL0038200.071.51% 
BombyxBGIBMGA006582-TA0.069.63% 
DrosophilaIA-2-PC2e-16550.29% 
EBI UniRef50UniRef50_D6WZ660.045.93%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WZ66_TRICA
NCBI RefSeqXP_974566.20.045.83%PREDICTED: similar to receptor-type tyrosine-protein phosphatase N2 [Tribolium castaneum]
NCBI nr blastpgi|2700139370.045.93%hypothetical protein TcasGA2_TC012616 [Tribolium castaneum]
NCBI nr blastxgi|2700139370.046.53%hypothetical protein TcasGA2_TC012616 [Tribolium castaneum]
Group
Gene OntologyGO:00064706.9e-107protein dephosphorylation
GO:00047256.9e-107protein tyrosine phosphatase activity
KEGG pathwaydme:Dmel_CG317952e-163 
 K07817 (PTPRN)maps-> Type I diabetes mellitus
InterPro domain[804-1067] IPR0002426.9e-107Protein-tyrosine phosphatase, receptor/non-receptor type
[962-1064] IPR0035951.2e-36Protein-tyrosine phosphatase, catalytic
[578-652] IPR0216135.6e-07Protein-tyrosine phosphatase receptor IA-2
Orthology groupMCL12075 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212132-TA
ATGGCCCAGGGCGACAAATTCGGGGGGGCGCCAAGGTCTGAAGTTGGAGTCTCTCAAGAAGTAGTAGTACAAAGAAGTACCGCGGGGAAGTGGCATCGTGTGAATGGAGGAGCGCATGATCGATACAGCGGAGTGCGCAGGTCCCACCAACTAAGTACAGCTGTAGTCTTTGCATCTTATCGTGCAAGTCGTTCTAAACGGCGTCACGCTCTCGGAAGAATTAATTATCACGGATATGTTCCGGCTTTTTTGGCGGATCTAAACGATGTAATGAGCCGTACGTGCTATCGGCAGGCTCTGTGGGCGCTGGCGGTGATATGTACGCTGGCGCCCTCGAATGCCGACGGGAATATTGGTTGTTTGTTTAGCTCGTCCTTATGCATTGACGGAGCTGAGTGGTGCTATGATGATTTCGCATTTGGAAAATGCATTCCGATCTATGACAACGATCCCGAAGAGGGGTCACTTTACCAATATGATATGAGCTCTACGCAGCTACAGTGGTTCGAGAGGGAGTTGCAACAACTAGCAGCCCAAGGCTATCGCTGGGAGCACGCATTCACGCAATGCATGCTGCAAAGTATGTTGTACGCTCTACGACATCACCTGGATCCAAATCAAGTTAACTCAAAGCTGTGTGAGCATTTCGCGGATCCCAAGCTAAGTGCGGGAGTAACAAACGTTGGAGACGAAACGTTGGATGCTAACTCTGATGAAACAGCGTACATAAGATTCGTACCAAACACTAAATTGATCGACTCAGATTATGCAAATGAAGTATATAATCCACCCTTACTTGACGATGAAGAACCTAAAGGCGATGATTCATCTATGAAAATTGAAGATTTAGAACCTGTTGAAGACACAAACGATAAAATACGAAATATAATGCTAATGAGTGGGGTCGAATCACCTGTGATTGTACCATTTGAAGGCTTTAGAGAACGCTTACAAGCTGAAGAGGAAGCACGACATCATGTACCTAAGGACAGTATCATTAATTATGATAAAGAAAAAAAATCCTTGAACGCAAATAACAATCAAGAAAAGCCAGTTAATGAACTCCCAGATGAAGAACGGTTATTGGCTCATTTTCGTAAATACAAAATAAAACCACCGCCCTTCACAGCGGAATACTTGACTGCTAACAGATTCTCACCATTGGATGAAGAAATAAGATCGAATGCTCTGGAAAAATACAAGCAAAGCTTTCTAGAAAAGAACTTCCCTTTTGAATATGAAAACCCAGAAGATCTTTCAGAAGCAAGGAGCTATGTAGAAACCCCACCTAGCGAATTAAATGGAGACGAAGGAACAACTAATGAAAAGGAAAACAATTCCAAAGAAATAAACCCAAAGAACATGGAATATTTAATGAACTATTGGCGTGAAATTGTTGGTGCAAAATTAAAACCTCAAGAAAATTTATATGCTGAAGGAGGTCCATTAAAAACGGATGAACTGCAAGGTGAAAATTCAAAATTTTATTTATCTCAAGATTTGCAAGACTTAGTTAATAGAGAATGGGGATTTAAGCGTAGGGAAAGAGATGATGTTAAAAAGCCTGGGCCGCGTGTGGACGCAAAAGCATTAAAGATTTTATACAGCAATAAATCAGTGACAGCCCAATCATCTAATCAGAATCAGATTATATCGGACCACGATCACAACGATTACGACTACGATCCATCTTACGCGTTTGTAACTTTTCACAATAGGTTTTTGACAGACTGGGAGAAAGGTATTTCATTCATAACACGTCTTGAAGAGATGTTGGGCTTAGAAAAAAATACGTTTACAAATCCCCGAGTCGATCCCAGCGAAGTCACTTTTAAAGTAGAAAAAAATAGCAAAGGCTACGATGCAGCAGATGTTGCTAAGCAAATTGACGTTATCAAGGAAAAAGTACGTAAGGACACTGGAGCACAAATACAATCGGCTGGAGTTGGAGATAGGAGCAAATATCCAATGATTCGTAACTCCGAGTCCAAGGAGAATCAACTATTTGGTTTGGATTATCCAGTACTACTAGCACTTGTGGGTAGTTTGTCAGTTCTTATCGTGGGAGCAGTGGTGTTTGCTGTTTTGTTGAAGAGGGATATGAGTGCTAGGCGGAAGATGCAGGGCTTGGCTTCAGCAGCTGAGATCGACGCTGAGGCTACAAGGGATTATCAGGAACTTTGTCGTGCTCGCATGTCCGGTAAATGGACGGGCACGCAGACCGCAGTCGCTCCTCCAACTGAACCTCCGCAAAGGATTACGTCGCTATCACGTGACCCAGACGGGAATTCACCCTCTACTAGATCAAGCACTTCATCTTGGAGTGAGGAACCGGCTTTGACTAATATGGACATTTCCACTGGACATATGGTTTTGGCTTACATGGAAGACCATCTCCGAAACAAGGATCGCCTGGAACAAGAATGGCAAGCGCTTTGCGCTTATGAAGCTGAACCATGTGCTACCGCAGCGGCCCTGAAACCTGAGAATAACGGCAAGAACCGTTGCGCCGATGTCTTGCCTTACGACCATTCTAGAGTCATACTCAACACTCTCTCCAATCACCTTGGATCTGATTATATCAACGCATCTACGATAACTGACCACGACCCACGTAACCCGGCCTACATAGCGGCAGCTGGTCCATTGGTGCAAACAGCTCCGGATTTCTGGCAAATGGTATGGGAACAAGGCAGTGTAGTCATGGTGATGTTAACCCGCCTCACTGAAAACGGACAACAGCTCTGTCATCGATATTGGCCTGAAGAGGGTTCAGAACTGTACCACATTTATGAGGTCCATCTCGTGAGCGAGCACATTTGGTGTGACGACTATTTGGTCCGAAGCTTCTATCTGAAGAACCAACGTACTGGCGAAACTCGTACTGTCACACAGTTCCACTTCCTCTCGTGGCCCGAGAATGGAGTACCAGCTTCTACCAAGGCATTGCTTGAGTTCAGAAGGAAGGTTAATAAGTCTTACCGCGGAAGATCTTGTCCGATTGTTGTCCATTGCAGTAATGGAGCCGGTCGAACCGGTACATACTGTTTGATCGACATGGTTCTCAACCGCATGGCTAAAGGTGCAAAGGAAATTGACATCGCCGCTACTTTGGAGCACATCCGCGACCAACGCACACGCACTGTCGCTACCAAACAGCAGTTTGAATTCGTACTGATGGCTGTTGCAGAAGAGGTACACGCTATACTAAAAGCCTTACCAGCCCATCTACAACAGCTGCAGGAGAAGAAGGACAAAGAGAAGGAGAAAGAAAAAGGATCAGAGAAAGAAGGCACTGATAAAGATAAACCAAACTAA

Protein sequence:

>DPOGS212132-PA
MAQGDKFGGAPRSEVGVSQEVVVQRSTAGKWHRVNGGAHDRYSGVRRSHQLSTAVVFASYRASRSKRRHALGRINYHGYVPAFLADLNDVMSRTCYRQALWALAVICTLAPSNADGNIGCLFSSSLCIDGAEWCYDDFAFGKCIPIYDNDPEEGSLYQYDMSSTQLQWFERELQQLAAQGYRWEHAFTQCMLQSMLYALRHHLDPNQVNSKLCEHFADPKLSAGVTNVGDETLDANSDETAYIRFVPNTKLIDSDYANEVYNPPLLDDEEPKGDDSSMKIEDLEPVEDTNDKIRNIMLMSGVESPVIVPFEGFRERLQAEEEARHHVPKDSIINYDKEKKSLNANNNQEKPVNELPDEERLLAHFRKYKIKPPPFTAEYLTANRFSPLDEEIRSNALEKYKQSFLEKNFPFEYENPEDLSEARSYVETPPSELNGDEGTTNEKENNSKEINPKNMEYLMNYWREIVGAKLKPQENLYAEGGPLKTDELQGENSKFYLSQDLQDLVNREWGFKRRERDDVKKPGPRVDAKALKILYSNKSVTAQSSNQNQIISDHDHNDYDYDPSYAFVTFHNRFLTDWEKGISFITRLEEMLGLEKNTFTNPRVDPSEVTFKVEKNSKGYDAADVAKQIDVIKEKVRKDTGAQIQSAGVGDRSKYPMIRNSESKENQLFGLDYPVLLALVGSLSVLIVGAVVFAVLLKRDMSARRKMQGLASAAEIDAEATRDYQELCRARMSGKWTGTQTAVAPPTEPPQRITSLSRDPDGNSPSTRSSTSSWSEEPALTNMDISTGHMVLAYMEDHLRNKDRLEQEWQALCAYEAEPCATAAALKPENNGKNRCADVLPYDHSRVILNTLSNHLGSDYINASTITDHDPRNPAYIAAAGPLVQTAPDFWQMVWEQGSVVMVMLTRLTENGQQLCHRYWPEEGSELYHIYEVHLVSEHIWCDDYLVRSFYLKNQRTGETRTVTQFHFLSWPENGVPASTKALLEFRRKVNKSYRGRSCPIVVHCSNGAGRTGTYCLIDMVLNRMAKGAKEIDIAATLEHIRDQRTRTVATKQQFEFVLMAVAEEVHAILKALPAHLQQLQEKKDKEKEKEKGSEKEGTDKDKPN-