Monarch geneset OGS2.0

DPOGS212992
TranscriptDPOGS212992-TA3570 bp
ProteinDPOGS212992-PA1189 aa
Genomic positionDPSCF300024 - 839231-854548
RNAseq coverage246x (Rank: top 42%)
Annotation
HeliconiusHMEL0077010.079.18% 
BombyxBGIBMGA006906-TA9e-17976.83% 
Drosophilarogdi-PB2e-9064.20% 
EBI UniRef50UniRef50_D6WN327e-12935.01%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WN32_TRICA
NCBI RefSeqXP_001812575.11e-15136.56%PREDICTED: similar to sorting nexin 13 [Tribolium castaneum]
NCBI nr blastpgi|1892377513e-15036.56%PREDICTED: similar to sorting nexin 13 [Tribolium castaneum]
NCBI nr blastxgi|2420209721e-12934.07%Sorting nexin-13, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00055155.6e-25protein binding
GO:00071545.6e-25cell communication
GO:00350915.6e-25phosphatidylinositol binding
KEGG pathway 
InterPro domain[315-548] IPR0031141.3e-32Phox-associated domain
[825-930] IPR0016835.6e-25Phox homologous domain
[1039-1143] IPR0139371.6e-15Sorting nexin, C-terminal
Orthology groupMCL15839 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212992-TA
ATGAATGACAACGAAAAAGAGGAAGCTGCAAATTTGAAAGACGAGTTTGAATGGGTTCTTCGTGAGGAAGTTCACGCTATATTACATCAACTACACTCAGTTCTAGTGGAATGTGCACATCGTTTTCCAGTCCCACTATATGGGAATGAAGGACAAAAACAAGACAAATTTATATTAACATCTCAACCAGAACAACTGAAGTGTGTTGTCACCCTTACAGGAGATAGTATCACGCATGCAGATATAAGTTTTAAAGTATTAAGGCAGATGCACACAATATGCAAGACCTCAATTAACCAGGATGGGCCTTGGAAACTGCAACAGATACAGGATGCTGCTAATCATTTACAACAAGCCATTGGCTATATAGATAATGTTGATAAACATTATGTATTCAGGTCATCTGAAGAAGTGCTTCATATAATACAGTGCTTGCTAGGATCCTTACAAAGAGCTAGAACAGCTCTTGTGTTACCTAAGAAAAAGGCTATCGATGAGTTAATGAAGAGTAGAAATATGAAAGCATTATCTCCAAATTTGCCAGAAGATCTTGCAATATCTTTCTACATACAGTCCCATAAATTGATATTTGTGGCGTATCAGCTTAGTTTGGTTCATGGAAGTATGCGCATTGACTCATGTCAAGCCGACTGTGCCGTACCATGGCTAAGCGATGTCCTGTTCATGTTAACGGCCGCTTTGCATATGTGTCAACAACTTAAAGATAAGTATCCTGATAACCTATTGGCTCTCTCATTAGATCCAAGGGTCTTGACTATCTTATATTTGAACTACGATAGCAAGAAAATATTCCCATTAGATATTGATTATTTGGATGATCCCCTGCAACAATCAAATTTCTCTGTTCAATCAGCTAAAATTTTAGAATTATTTAAATCTAAAAAATCACTGCCTAAATTTGACAGTAGAATAACTGGTAGTGAGACAGTGGATTCATTCCTGAATGAAATTGTTTCAATAATTATCAATGACTATGTCACAACTTGGTACGAACTTATAACTGATGATCAAGAGCTTACTACATATGCAATAAAGAAATTAGTTGTTGCAGCAGGCGCAAATGTATCAAACAGGGTAAAAACTGTAGACTGGATACATCTGTTAGCAACTAGGTTCCCGGAAGAATTAACGTTGCACTTAAAGTTATTTAAACAGTCTAGGGTAAGGTTAAAACGGATGCAGTTGCTGAGTGCCAAAGAAATGAATGGCAATTCAAAGCCAGTTCCTAAGAGGCCAGAGGAAAAACGGACTCATAGGAGAAATAAGAGCGAAACCGATTTACTTTGGCCTCCAGATTCCCAATCTTTTGGTAAATCAAAGTTTTACAGCAGTTCAGAAAATATAAGCAGTAACAATATAAAGGACCTGTTCTTTGATTTGGAGTGTTCAATAGAGAATAAAGAGTTATGTAGGGATATATTTTGCACTGATCCAGAGAAGGAAGCCGCCCTCCTGTCCGAGGTGTCGGAAGCTCTGCTATATCTGTTTGTGCCAGAAGAAGCGTGGAACTGTCATGCAATGAAGTTAATACTTATCGATCTGCTGTCGTCGATAGTTCTCAGGCCATTGATAAAAATGTTGAGCGACCCTGACAATATAAATAGAGCTATAATAAGATCGTGTTGTCGTGACTCGTGTCTATCATCTGATTTGTTCCTAATGGTCATAAGAACCTGTGGCGATGCCGAGGAATTAGACGCGACGCTGGAGCTCGTGCAGAAGGATATACAGAAATTACATTCAAAAGACAGTGCTGGTGAATGGGAACTACAAGATCGACAAAAACTGTCTTCTTTGCAATATTTATCAAGGATCATACTGGCCAGTCGAGCGACGCTGGGACCACAAGAAGGTTTATCCACAACAGATAATACGGAGAGAGAATCGGAAGAAGTGATGAAAACTTTAAAGTTTCTTAGAGCGGTGGCCGCTTGGAAATCTAACGCACAATATCTATTAGAAATTGAATTAAATGAAGCAGATTCACACACAACTAAAGCTGTGATGGACAATCTGCGTTCATCAGCTCTGGAGGTGTGTGACTTGTACCTGAGGGGTGTGGGGGTGCTGGGGGTGCCTGACAACGCCCACGCTGACCTAGTCAGGAGGATCACCACCGACGGCGGGGAGTTCACCAGCAATCCCATACAGTGCTTTGATGATGTACAGAAATGTGTTTGTGACGCGCTTGAGGAGGATCCCTCGTGGCAAGCTGATTTTATGTTCGATGGCGACCAGGATGGTATGGATAATAGTGAAAACAAAAAAGATTTGAAATCGGAGTCCCATAAATATACTGGCGACGTTGTGCTACCCGTTCCTGGCGCGAGGCATAACAGAAGTCGTTCAGATATCGTTGGCAGTTTCGCTCAGAAAATGATGAATGAGAGTGCAGGTCCATCTAATTTAAAATCTGCGACCACAAGCACCTTGTCGTTGTCACATTCATCTATAAGTCAAAGCGTTAAAAACATGATGTCGCCGTTAACTGCGTATATTATAGAAACAGCATTGGTACAAGACAAAGGTAAAACTTTCGGTATATACGCTATAGCTGTGACGAGAGAATCTGATAACGAGGTCTGGCACATATATAGACGATATAGTGATTTCTATGATCTACACGCATCAATCAAAGAGAAGTGGCCAGAGCTAGGTCACCTTCCGTTTCCGGCTAAGAAGACATTTCAGAATACATCTAGATCAGTTCTGGAGAGTCGTAAACGGATGCTCAACAGTTACTTACAAAGTTTAACGAGTATTTCGAGGGATTCCAGGTACATGGCGCTACTGTCGCCTGACTATCTTGGAGGATTTCTCAGTCCGGAAAATCAAACTGAAAGGCATGGGAATACGATTGACGCACTTCTAGTCAATTCACTGAAAGCTGGTATGAGGACTTTAAAAAGTATGCCCGATCAGTTCGCAAATACAGTCGACGGAGTCATGGACGGTATATCAAAAGTATTCCAAGGTAAAAGTGGTGAAAATCTAAAGAATTTCAAAACTTGGAATTCATCGGACGTTCAGGACGATAACGACGAGAGTGTACCATTGAGGCTGTTAGAGGAGGTTCTCGGCATCAGGGGTTTGTGGCTGAGAAGAAGATTACTGGCTCCGCTACGGACGATGATCGCTGATAGAGTTAACAAAAAAGTTATAGAGTTCGTCTCATCTTTAACGTCACCTCGGAACGTGGTTCAATATTTGAAAACTTTTAAACAGTGGCTTACGAGTAGGAACAATCCGAGCGCCGTCTCCAGGGATCAGGCCACAAAGGCAAGGACGAGGGTAGCTGCTAAAGTTGCCTTACTATCAGCTGCTTCAGACGATCTCCGTCACATAGTTGGTACGGACGCTGCCAGGAGAGGACTCCTCACAGTGTTCGATCTGTTTCAGACTCAAGAGATCAACAAACGACTCTTGTTCGTTCTACTTGAAGTAACACTGACAAACCTGTTTCCGGACAACAACATCCGGGATATGTTCAAGACGCTGTACTCAAACTCTCCTCGAGTTCCGTCTACCAAGAAATCAGTTAATGTATAA

Protein sequence:

>DPOGS212992-PA
MNDNEKEEAANLKDEFEWVLREEVHAILHQLHSVLVECAHRFPVPLYGNEGQKQDKFILTSQPEQLKCVVTLTGDSITHADISFKVLRQMHTICKTSINQDGPWKLQQIQDAANHLQQAIGYIDNVDKHYVFRSSEEVLHIIQCLLGSLQRARTALVLPKKKAIDELMKSRNMKALSPNLPEDLAISFYIQSHKLIFVAYQLSLVHGSMRIDSCQADCAVPWLSDVLFMLTAALHMCQQLKDKYPDNLLALSLDPRVLTILYLNYDSKKIFPLDIDYLDDPLQQSNFSVQSAKILELFKSKKSLPKFDSRITGSETVDSFLNEIVSIIINDYVTTWYELITDDQELTTYAIKKLVVAAGANVSNRVKTVDWIHLLATRFPEELTLHLKLFKQSRVRLKRMQLLSAKEMNGNSKPVPKRPEEKRTHRRNKSETDLLWPPDSQSFGKSKFYSSSENISSNNIKDLFFDLECSIENKELCRDIFCTDPEKEAALLSEVSEALLYLFVPEEAWNCHAMKLILIDLLSSIVLRPLIKMLSDPDNINRAIIRSCCRDSCLSSDLFLMVIRTCGDAEELDATLELVQKDIQKLHSKDSAGEWELQDRQKLSSLQYLSRIILASRATLGPQEGLSTTDNTERESEEVMKTLKFLRAVAAWKSNAQYLLEIELNEADSHTTKAVMDNLRSSALEVCDLYLRGVGVLGVPDNAHADLVRRITTDGGEFTSNPIQCFDDVQKCVCDALEEDPSWQADFMFDGDQDGMDNSENKKDLKSESHKYTGDVVLPVPGARHNRSRSDIVGSFAQKMMNESAGPSNLKSATTSTLSLSHSSISQSVKNMMSPLTAYIIETALVQDKGKTFGIYAIAVTRESDNEVWHIYRRYSDFYDLHASIKEKWPELGHLPFPAKKTFQNTSRSVLESRKRMLNSYLQSLTSISRDSRYMALLSPDYLGGFLSPENQTERHGNTIDALLVNSLKAGMRTLKSMPDQFANTVDGVMDGISKVFQGKSGENLKNFKTWNSSDVQDDNDESVPLRLLEEVLGIRGLWLRRRLLAPLRTMIADRVNKKVIEFVSSLTSPRNVVQYLKTFKQWLTSRNNPSAVSRDQATKARTRVAAKVALLSAASDDLRHIVGTDAARRGLLTVFDLFQTQEINKRLLFVLLEVTLTNLFPDNNIRDMFKTLYSNSPRVPSTKKSVNV-