Monarch geneset OGS2.0

DPOGS211144
TranscriptDPOGS211144-TA3135 bp
ProteinDPOGS211144-PA1044 aa
Genomic positionDPSCF300007 - 145119-150094
RNAseq coverage8x (Rank: top 86%)
Annotation
HeliconiusHMEL0026153e-5832.91% 
BombyxBGIBMGA003016-TA1e-17666.82% 
DrosophilaCG16854-PB1e-11035.12% 
EBI UniRef50UniRef50_D6W8Y92e-14432.94%Putative uncharacterized protein n=3 Tax=Tribolium castaneum RepID=D6W8Y9_TRICA
NCBI RefSeqXP_973695.16e-12541.43%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|2700019667e-14432.94%hypothetical protein TcasGA2_TC000881 [Tribolium castaneum]
NCBI nr blastxgi|2700019662e-14432.99%hypothetical protein TcasGA2_TC000881 [Tribolium castaneum]
Group
Gene OntologyGO:00081525.4e-12metabolic process
GO:00038245.4e-12catalytic activity
KEGG pathway 
InterPro domain[444-1043] IPR0042456e-136Protein of unknown function DUF229
[1-352] IPR0178505.4e-12Alkaline-phosphatase-like, core domain
[19-277] IPR0178497.6e-10Alkaline phosphatase-like, alpha/beta/alpha
Orthology groupMCL15408 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211144-TA
ATGGGCATTGATGCGCTTTCGAGATTGAATTTTCATAGAACAATGCCGAAAACTTTGGCATATTTGAAAAAGAAGGGTGCTATAGAATTTTATGGTTATAACAAAGTCGGTGACAATACTTTCCCTAACCTGTCACCTATTCTTCTCGGAATGAAAGACACAGATTTAAAGAAGGCTTGCTGGCCGCATGTAAGAGCTACATTCGATAATTGTCCTTTTATTTGGGATTTATTTAAAAGCGACGGTTACTACACAGCCTTCGGTGAGGACACTTCCAGTTTAGGAACATTTAACTTTGAAAAAGTCGGTTTTAGTCGAACTCCAACCGATTACTACTTACATACGTTCATGCATGAAGCTGAGCTGTATACTGGTAACAACAGAGACTTCAACTCCTACATATGTATGGGAAATAAATACTTTTACAAGGTTTTATTAGATTACATAGAAAATTTGACAATGTCTTTACGTACTTCCAAACTCTTTGGATTCTTTTGGGAAGTCACATTGAGTCATGATTATTTAAACTATCCTATGGCGATGGACGAGAACTATGAAGTATTTTTGAAGAATTTAGACGACGCACGCTATTTGGACGACACTATTTTAATAATCCTAAGTGATCATGGAATACGTTGGGGTGATATACGATATACGAAACAGGGGAGGCTGGAGGAGCGACTTCCACTTTTGCATGTTTTACTACCCGAGTCGTTTAGAGTGAACTATTCTTTAGCTTACGATAATATCAAACTGAACAGCAATCGTCTTACGACGCCGTTCGATCTGTATGCAACTTTAATAGATCTTTTACATATGGATGGGATAAGTAACGACAACTTGAAATTAAGAAGCGAAACGCCGTACGGTAACGAGAGAGCTATCAGTTTATTTATGCCTATTCTTAGCAACCGTACTTGCACTACAGCGGGTATAGACGACCACTGGTGCACGTGTCGCAGAGGACGGAAGATACCAATCTTCAGCGCGGAAGCCTACGACGCTGCTGATAATTTATTGACTTTGATAAACCAACTACTAAAGGGATATTATCAATGCGCCCATCTGATGCTTGAAGAACTGATAGAGGTGACAGAAATCATATCAGGCACTCCGTATGAGAAAGAAGTCGGTTGGCGAGAATTCTTAGTCGTCATACGGACATCGCCTGGCGGGGCCGTGTTTGAGGCAACTCTTCGTCAGGATGGCCAGACCTGGTCTCTCGCCGGTACCGTCAGCCGACTTAACCTGTACGGACAGCAGGGCCACTTATGGCTTGGACTGTGGGAGGTGCTGGTTGCAGCAGCATTACAAGAACAAATCAAAGGAATAAAGGACTCCAACGAGATCTTAAGCGATGAAATGTATACAATCAGAACAAATGGATGCATCATATCTGCTTTACAGCCTTTAGGCAGTGAAGTGAGACAGTTAGTGAAATTCCCAAAAGATTTAAAACCGTGTCCAATGTCAGCTGTGGCATTGTTGTCTAACAACAGAACGCACATATGGATAAAACATGAAAACAGGCAATATTATAATATAAGCGATGCTGCCTTTTTGAAATGTTGCTATAAATCATTTTACCGGCCGCTCTCAGTCGACGACATTACATCCCGAGATGTCGACAAACGAGTTAAATATAACGTTTGTTTTAATTTTACTACCTCAATTATAGCAGCTCATGAATTTGTTCGAGTAAAATGTTATGCTGGATCAGTGGAAATTTACGACCAGTTTTTCATTTTTGCTCCCAAAAAAGAACTATCATCTGAAGTACCAGAGATTCCCAAAAACAAAACCGCATATAATGTTCTTATAATGGGAATAGATGGTGTATCCAGACTTAATTTTCATAGAACAATGCCAAATACTTTTGCATTCTTAGAAAAAAAAGGTGGTGTGGAGTTATTAGGATACAATAAAGTCGGTGACAATTCTTTTCCTAATTTGATACCCATGCTGATGGGGTTATCAGAACAAGATCTAAAGTCCACGTGTACTCCTCGTAAAAAATCAACATTTGATAATTGCCCCTTCATTTGGGAATGGTTCAAAGAAGCTGGTTACTATACAGCGCTGGGAGAAGACAGTGCCAGTCTAGGAACCTTCAATTATGGAAAATTTGGGTTTATTGGATCTCCCACGGACTACTACATACATACCTTTATAAATGAGGCTGAAGAAAACGTAGGAGTTAATAAAGATTTTAATTCATTTTTATGTATGAATGATAAGTATTTCTATAAGGTCTTATTAGATTACATAGAAAATCTAGCGACGGCACTGAGGACATCTAAACTATTTGCTTTCTTCTGGGAAGTGACCATGACTCATGATTATTTAAATTATCCAATGATAATGGATGAAGATTACGTAAAATTTCTAACACGATTAGATTCCGTAAATTATTGGAATGAGACAATACTTATTTTCATTAGCGACCATGGAATTCGTTATGGCCAAATAAGGTCAACGAATCAGGGACGGTTAGAAGAACGCTTACCTTTCGCATACATATTGTTGCCACCGGACTTTAAGGAGAAATATAAAGAAGCTTACAGAAACCTTCAATTAAACAGTAAACGCCTTAGTACTCCTTACGATATTCACGCCATGTTATCCGATCTTGTTAATTTAGATAATATCGAAAGCGATAAAATTGTTTTACGCTCAAATACGAATGAAGAACATGTTAAAGGTAACAGTTTATTTATACCTATACCTTTGAGCCGCACTTGTTCGTCCGCTCACATCAGTGATCACTGGTGTAGCTGTCAGAAGAGTCATAAAATATCAACTAAGAGTAAACTTGGTAAAGAAGTAGCTAAACAATTAGTACAACAGCTCAATGACCTCGTTAGCGACTATGAACAGTGCGCTAAGCTAAAACTAGCTGAACTAATCGAAGTTATTGAAATGGAACCGAATTCCTTCGGGTACGATTCAGTTACCTGGCGAGAGTTCATGGTTATGGTGAAAACTATACCAGGCAACGGTTTGTTCGAAGGAACTCTTCGTGTTGTTGATAACGAATGGTTAGTGGCTGGATCTATAAGTCGATTTAATCTATACGGCAATCAGAGTCACTGTATTCAGGACACCTCACTGAAGTTGTACTGTTACTGCCAATAG

Protein sequence:

>DPOGS211144-PA
MGIDALSRLNFHRTMPKTLAYLKKKGAIEFYGYNKVGDNTFPNLSPILLGMKDTDLKKACWPHVRATFDNCPFIWDLFKSDGYYTAFGEDTSSLGTFNFEKVGFSRTPTDYYLHTFMHEAELYTGNNRDFNSYICMGNKYFYKVLLDYIENLTMSLRTSKLFGFFWEVTLSHDYLNYPMAMDENYEVFLKNLDDARYLDDTILIILSDHGIRWGDIRYTKQGRLEERLPLLHVLLPESFRVNYSLAYDNIKLNSNRLTTPFDLYATLIDLLHMDGISNDNLKLRSETPYGNERAISLFMPILSNRTCTTAGIDDHWCTCRRGRKIPIFSAEAYDAADNLLTLINQLLKGYYQCAHLMLEELIEVTEIISGTPYEKEVGWREFLVVIRTSPGGAVFEATLRQDGQTWSLAGTVSRLNLYGQQGHLWLGLWEVLVAAALQEQIKGIKDSNEILSDEMYTIRTNGCIISALQPLGSEVRQLVKFPKDLKPCPMSAVALLSNNRTHIWIKHENRQYYNISDAAFLKCCYKSFYRPLSVDDITSRDVDKRVKYNVCFNFTTSIIAAHEFVRVKCYAGSVEIYDQFFIFAPKKELSSEVPEIPKNKTAYNVLIMGIDGVSRLNFHRTMPNTFAFLEKKGGVELLGYNKVGDNSFPNLIPMLMGLSEQDLKSTCTPRKKSTFDNCPFIWEWFKEAGYYTALGEDSASLGTFNYGKFGFIGSPTDYYIHTFINEAEENVGVNKDFNSFLCMNDKYFYKVLLDYIENLATALRTSKLFAFFWEVTMTHDYLNYPMIMDEDYVKFLTRLDSVNYWNETILIFISDHGIRYGQIRSTNQGRLEERLPFAYILLPPDFKEKYKEAYRNLQLNSKRLSTPYDIHAMLSDLVNLDNIESDKIVLRSNTNEEHVKGNSLFIPIPLSRTCSSAHISDHWCSCQKSHKISTKSKLGKEVAKQLVQQLNDLVSDYEQCAKLKLAELIEVIEMEPNSFGYDSVTWREFMVMVKTIPGNGLFEGTLRVVDNEWLVAGSISRFNLYGNQSHCIQDTSLKLYCYCQ-