Monarch geneset OGS2.0

DPOGS200454
TranscriptDPOGS200454-TA1344 bp
ProteinDPOGS200454-PA447 aa
Genomic positionDPSCF300260 - 142630-150902
RNAseq coverage281x (Rank: top 39%)
Annotation
HeliconiusHMEL0130709e-12380.43% 
BombyxBGIBMGA011186-TA0.083.74% 
DrosophilaSnx6-PB2e-15363.61% 
EBI UniRef50UniRef50_Q9VLQ93e-15364.55%LD22082p n=24 Tax=Bilateria RepID=Q9VLQ9_DROME
NCBI RefSeqXP_001657203.12e-16361.62%sorting nexin [Aedes aegypti]
NCBI nr blastpgi|1571380303e-16261.62%sorting nexin [Aedes aegypti]
NCBI nr blastxgi|3479720016e-15563.27%AGAP004487-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00055157.5e-15protein binding
GO:00071547.5e-15cell communication
GO:00350917.5e-15phosphatidylinositol binding
KEGG pathway 
InterPro domain[98-209] IPR0016837.5e-15Phox homologous domain
[235-385] IPR0154042.6e-14Vps5 C-terminal
Orthology groupMCL14014 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200454-TA
ATGATGGACTGCATTGACGACAACACCAACGATCCCCTCTCAGCCCAAGTGTCCTCGCCGTCTGGAGAAACTAAAGTTGATAAGAAAAAACCCAACGAGAATGTCTCGTTAGCGGATAACAGTTTACTGGTGGATATATCAGACGCTTTGAGCGAGAAAGAGAAAGTAAAGTTTACGGTACACACCAAGACAACGTTGCCTGAATTTCAGAAGTCTGAGTTTTTTGTCGTCCGACAGCACGAGGAATTTGTGTGGCTTCATGACAGATATGAAGAGAATGAAGAATACGCTGGTTATATTAAGTCTGAGTTTTTTGTCGTCCGACAGCACGAGGAATTTGTGTGGCTTCATGACAGATATGAAGAGAATGAAGAATACGCTGGTTATATTATTCCTCCTGCCCCTCCCCGTCCCGACTTCGACGCATCCAGGGAGAAGTTACAACGGCTCGGGGAAGGAGAAGGAGCCCTAACACGAGAGGAATTCCTCAAGATGAAGGAGGAACTCGAGGAGGAATACTTGGCCACTTTCAAGAAGACCGTGGCTATGCACGAGGTGTTCCTGCAGCGGTTGGCATCACACCCTGTATTCAGGGGGGATGCGCATTTGAGAGTATTCCTCGAGTACGAACAGGATCTATGTGCGAAGCCGCGGGGCAGAATGGACCTTATTGGGGGTCTGATGAGGTCAATGACCACTACCACAGATGAGATTTATCTCGGTGCTACAGTCAGGGACGTTAACGACTTCTTCGAACAGGAGACAGCGTTTCTCCAAGAATATTATTCTCATCTCAAAGAGGCTGTAGCGAAAGTCGACCGTATGACCAGCAAGCATAAGGAGGTGGCAGACGCTCACATCAAGCTCTCGTCCTGCGTCACCCAGCTGGCCACGCGTGAGGCTCAACACACTGAGCGGTTCCTCAGCAGGGCTGCCGACACCTTCGACAAGTGCAGGAAAATCGAGGGTCGGATGGCATCAGATCAGGACCTGAAGCTGGCGGACACGTTACGTTACTACATGAGGGACACACACGCTGCCAAGGCGGTGCTGGTGAGGAGATTACGGTGTCTGGCAGCATACGAAGCAGCCAATAGAAACTTAGAGAGGGCGAGAGCTAAAAATAAGGACGTGCACGCCGCGGAACAGGCCCAAGCGGACGCCTGCGCTCGTTTCGAACAGCTATCAGCTCGCGCCAGGGAGGAGCTGATAGACTTCAGGACACGAAGAGTTGCGGCTTTCAAGAAAAGTTTAATAGATCTGGCGGAGCTGGAGATCAAGCACGCTCGGGCGCAGCAGGAGTTGTTCAGGAAGTCGTTACAAGTATTGAGGGAATGCCAGTAA

Protein sequence:

>DPOGS200454-PA
MMDCIDDNTNDPLSAQVSSPSGETKVDKKKPNENVSLADNSLLVDISDALSEKEKVKFTVHTKTTLPEFQKSEFFVVRQHEEFVWLHDRYEENEEYAGYIKSEFFVVRQHEEFVWLHDRYEENEEYAGYIIPPAPPRPDFDASREKLQRLGEGEGALTREEFLKMKEELEEEYLATFKKTVAMHEVFLQRLASHPVFRGDAHLRVFLEYEQDLCAKPRGRMDLIGGLMRSMTTTTDEIYLGATVRDVNDFFEQETAFLQEYYSHLKEAVAKVDRMTSKHKEVADAHIKLSSCVTQLATREAQHTERFLSRAADTFDKCRKIEGRMASDQDLKLADTLRYYMRDTHAAKAVLVRRLRCLAAYEAANRNLERARAKNKDVHAAEQAQADACARFEQLSARAREELIDFRTRRVAAFKKSLIDLAELEIKHARAQQELFRKSLQVLRECQ-