Monarch geneset OGS2.0

DPOGS208501
TranscriptDPOGS208501-TA2052 bp
ProteinDPOGS208501-PA683 aa
Genomic positionDPSCF300064 - 790005-792056
RNAseq coverage139x (Rank: top 55%)
Annotation
HeliconiusHMEL0087480.088.35% 
BombyxBGIBMGA010603-TA0.083.36% 
Drosophilasynj-PB0.061.83% 
EBI UniRef50UniRef50_Q5U0V70.061.83%GH06496p n=19 Tax=Neoptera RepID=Q5U0V7_DROME
NCBI RefSeqXP_001843863.10.063.76%synaptojanin [Culex quinquefasciatus]
NCBI nr blastpgi|3320231750.059.24%Synaptojanin-1 [Acromyrmex echinatior]
NCBI nr blastxgi|3227890410.056.12%hypothetical protein SINV_10499 [Solenopsis invicta]
Group
Gene OntologyGO:00044377.8e-118inositol or phosphatidylinositol phosphatase activity
GO:00037232e-35RNA binding
GO:00163112e-35dephosphorylation
GO:00044392e-35phosphatidylinositol-4,5-bisphosphate 5-phosphatase activity
GO:00001665.7e-25nucleotide binding
KEGG pathwaycqu:CpipJ_CPIJ0021940.0 
 K01099 (E3.1.3.36)maps-> Phosphatidylinositol signaling system
    Inositol phosphate metabolism
InterPro domain[123-449] IPR0003007.8e-118Inositol polyphosphate-related phosphatase
[103-469] IPR0051352.1e-77Endonuclease/exonuclease/phosphatase
[442-568] IPR0150472e-35Domain of unknown function DUF1866
[461-549] IPR0126775.7e-25Nucleotide-binding, alpha-beta plait
Orthology groupMCL11409 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208501-TA
ATGCTGGCTATTCAGCTAATGTTATTACAGTGTATTGACAAAAAGCAGAATGTAAATAGATTTGAAGAAGTTTTTAAGCAGATGTGGGTAAATAATGGTAATGAAGTGTCTAAAATATATGCTGGGACTGGTGCAATTCAAGGTGGCTCAAAACTAATAGATGGAGCGAGGTCAGCTGCTCGCACCATACAAAACAACTTACTTGACAACTCTAAACAAGAAGCTATCGATATATTGCTTTTAGGATCCACACTGAATTCAGAGCTTGCGGATCGAACAAGAATACTACTACCATCTAATATTCTACATACCTCAACACCAGTATTAAGAGAAATGTGCAGGAGATCCAGTGAATTTACACAGCCTTCCAGTATCCGTATAACTATTGGTACATATAATGTAAATGGTGGCAAACATTTTAGAAGTTTAGCTTATAAAGACGTATCACTCGCCGACTGGTTGTTAGATTCACCAAATTCATCATTAATTAACACTGTTAATTTGAAAGATCATCCATCAGACATTTTTGCTATTGGCTTCCAAGAGATTGTCGACCTTAATGCTAGTAATATCATGGCAGCAAGCTCAGAACACGCAAGAGCGTGGAGTGAAGAACTTGAGAAAATATTATGTAGAGATGCCTCTTACACTCTTCTGTCTAGTCATCAACTGGTTGGAGTTTGTCTATTTGTTTTTGCTAGAAAAGATTTAATTCCACATATAAGAGATGTTGCTCTCGATTCTGTTAAAACTGGCCTCGGCGGAGCTACAGGAAACAAAGGCGCAGTGGCAATTCGTTTGGTTATATACGGAACCTCTTTGTGTTTTGTTTGTGCTCATTTCGCTGCAGGTCAATCGCAAGTTACGGAACGAAATGCCGATTACACTGAAATCACTAGGAAAATTGCGTTTCCTATGGGAAGATCACTTTATTCTCATGATTACGTGTTTTGGTGTGGTGATTTTAATTATCGAATTGATTTAGATAAAGAAGAGACTCGATTATTGGCATCTCAGAATAATATACAAAGACTTCTAGATCAAGACCAATTATTACGCCAGAAATCACAGGATCTAGTCTTCAAAAACTGTTTTGAAGGAGAAATTACTTTCTTACCAACTTATAAATATGATTTATTTAGCGACGATTACGATACGAGTGAAAAATGTAGAGCCCCGGCCTGGACTGATAGAGTCCTTTGGAGAAGTAGAAAACATACAATTGATCCTGAAAATACTTCTGAACTGGCTGCTGGAATATTACTGCATTATGGCAGGGCTGAATTAAAGCAAAGCGACCATCGTCCGGTTATTGCAATTTTAGAGATTGAAGTTCTACAAGTTAGTTACACTAGATGTATGGAAGTTTTTTATGAAGTTGTTAAGGACCTTGGCCCTCCCGACGCCAGTATTATTGTAAAACCAATGGAAGAAATAGACAGCGAAGATTCCATTTTCGATGATAACATAATGGCAGGCGTTTTACAAGAATTGTCAACCATTGGTGAAGTGACATTAGTACGTTTTGTCGACGACCATACTTTGAGTATAACTTTTAGGGATGGCCAGCATGCATTAGCTGCGTCACAGAAAGGATACATTAACATTGGTGACATTCAGTTGACCATGTCCCTAAAGTCTCCTAATTGGCTGGATATTGTAAAGGCCGAGGTTAATTTGTGTTCCAATAACACGATTCCACTATTTGGCGAAGTTCCCGCCGATCGTCCCTTGCGGTCAGCTCCGCCGACTCCTCGGAGACAGCAGCCGAGTCGACCGCCGGCTCCATCGCCCACGCCTCTTGTACCTTCGAGAGCACCAACTGGACCAGCGCGCCCGCCACCTCCTAGGCCCGCTCCTATTCCTCAAGACGTAGGGATAGTCTCTCAGCAAAGTTTATCGTCACCTCCAGCCGATAGCCTGCCGCCACCAACATGTGCTCCTCCCCCTCCTCCTTCTAATACCCCACCCCCTACGGGACATCCGCCTCCAGTACCAGCGCGACAAGGTGCGCCACCACCTCTACCAGCGCGCCCAAGGCCAGTTATGTAG

Protein sequence:

>DPOGS208501-PA
MLAIQLMLLQCIDKKQNVNRFEEVFKQMWVNNGNEVSKIYAGTGAIQGGSKLIDGARSAARTIQNNLLDNSKQEAIDILLLGSTLNSELADRTRILLPSNILHTSTPVLREMCRRSSEFTQPSSIRITIGTYNVNGGKHFRSLAYKDVSLADWLLDSPNSSLINTVNLKDHPSDIFAIGFQEIVDLNASNIMAASSEHARAWSEELEKILCRDASYTLLSSHQLVGVCLFVFARKDLIPHIRDVALDSVKTGLGGATGNKGAVAIRLVIYGTSLCFVCAHFAAGQSQVTERNADYTEITRKIAFPMGRSLYSHDYVFWCGDFNYRIDLDKEETRLLASQNNIQRLLDQDQLLRQKSQDLVFKNCFEGEITFLPTYKYDLFSDDYDTSEKCRAPAWTDRVLWRSRKHTIDPENTSELAAGILLHYGRAELKQSDHRPVIAILEIEVLQVSYTRCMEVFYEVVKDLGPPDASIIVKPMEEIDSEDSIFDDNIMAGVLQELSTIGEVTLVRFVDDHTLSITFRDGQHALAASQKGYINIGDIQLTMSLKSPNWLDIVKAEVNLCSNNTIPLFGEVPADRPLRSAPPTPRRQQPSRPPAPSPTPLVPSRAPTGPARPPPPRPAPIPQDVGIVSQQSLSSPPADSLPPPTCAPPPPPSNTPPPTGHPPPVPARQGAPPPLPARPRPVM-