Monarch geneset OGS2.0

DPOGS214383
TranscriptDPOGS214383-TA1419 bp
ProteinDPOGS214383-PA472 aa
Genomic positionDPSCF300020 + 1055197-1060421
RNAseq coverage592x (Rank: top 21%)
Annotation
HeliconiusHMEL0046560.079.08% 
BombyxBGIBMGA004010-TA0.075.28% 
DrosophilaCG2774-PA2e-12049.07% 
EBI UniRef50UniRef50_B4KJL42e-11850.12%GI17151 n=5 Tax=Diptera RepID=B4KJL4_DROMO
NCBI RefSeqXP_966953.17e-14151.84%PREDICTED: similar to sorting nexin isoform 1 [Tribolium castaneum]
NCBI nr blastpgi|910871991e-13951.84%PREDICTED: similar to sorting nexin isoform 1 [Tribolium castaneum]
NCBI nr blastxgi|910871992e-13351.75%PREDICTED: similar to sorting nexin isoform 1 [Tribolium castaneum]
Group
Gene OntologyGO:00055158.8e-29protein binding
GO:00071548.8e-29cell communication
GO:00350918.8e-29phosphatidylinositol binding
KEGG pathway 
InterPro domain[312-438] IPR0154042.3e-36Vps5 C-terminal
[74-209] IPR0016838.8e-29Phox homologous domain
Orthology groupMCL13447 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214383-TA
ATGTCGGCGGAGACCGATGCACCGTTCAATAATGTGGAAATAAGTAACGAGAACCGGGAAGAGGAAGATCTATTCGCCTCGGCGGTACAGGAGGTCAGTCTAGATCCTGAAATTAATGGTACCCAAGATGGATTAGAAAAGTCAACAATAGATGACATTTCGGTTTCATCACCCGCTACTATTGGCAGTTCTATTATGGAGGAAATAGCAACAGAACGTGCTAACAATATTATAATAACAATAACCGAGCCTCAAAAGATTGGTGAAGGGATGAGCTCATATGTAGCCTACCGTGTCATCACCAAAACAAACATGCCAATCTTTAGCAAATTAGATTTTGCGGTTCTAAGGCGATTCTCCGATTTTCTAGGACTTCATGAGAAATTGACCGAGAAATACTTGCGCTCTGGTAGAATTATACCTCCAGCACCAGAAAAAAGTATCATGGGAACAACAAAGTTGAAGATGTCATCGACTCCGTCTACAGAGAGTGCTAATGGCTCACCGTCGGTTCAATCACAGTTTGTGGAACGAAGACGAGCTGCCCTGGAGAGGTTCCTGAACAGAGTAGCCCAACATCCTGTACTGTGTATTGATCCCGATTTCAGAGAGTTTTTGGAGTCTGACACTGAACTACCAAAGGCCACGAGTACCTCGGCGCTTAGTGGAGCTGGTATGCTGCGACTCTTCAATAAAGTTGGAGAAACAGTCAACAAGATCACATACAGGATGGACGAGTCCGATCCTTGGTTCGAAGAGCGCGTGGCTCGTATAGAGTCTCTGGAAAGCGGTCTACGGCGTCTGTGTGGGGCCTGTGAGGCGCTCGCTACTGAGAGACGTGAACTGGCGGGGCGAGCTCATGAGGCGGCTCGGGCCATCGCCGGATATATATACATATATTTTTTTAATATTAAAATAAACTTTGAAATTGAAGAGAATGAACAAGCCAACACAGACTTCTATGTTCTGACCGAACACATTAAAGATTATCTCGGATTAATTGGTGCTATCAAAGACGTGTTCCATGAAAGAGTTAAGGTATTCCAACACTGGCAACACTCACAAATGCAGCTAACGAAGCGGAGGGAAAACAAAGCGAAAGCGGAACTGGCCAACCGTCCGGAGAAAATCGAACAGGCCGCTAATGAAATTATTGAGTGGGAGTCGAAAGTGGAACGCGGCCAGCAGGAGTTTGATACAATGTCGAGGGTCATCAAGAAGGAACTGGAACGCTTTGAAGAGATCCGCCTCGACCAGCTCAGAGACACGCTGCTGCGGTATCTTGATGAGCATATGAAACACCAGGCACAGCTTAATCTTCATCTACAGGTAACTAAGAGCGATGCTAGCCAGTTCCATTATGCCACCGCTATTCGGTACTGGGACGCTTTCCTTCCTGAGGCCCGCGCCATCAAATGA

Protein sequence:

>DPOGS214383-PA
MSAETDAPFNNVEISNENREEEDLFASAVQEVSLDPEINGTQDGLEKSTIDDISVSSPATIGSSIMEEIATERANNIIITITEPQKIGEGMSSYVAYRVITKTNMPIFSKLDFAVLRRFSDFLGLHEKLTEKYLRSGRIIPPAPEKSIMGTTKLKMSSTPSTESANGSPSVQSQFVERRRAALERFLNRVAQHPVLCIDPDFREFLESDTELPKATSTSALSGAGMLRLFNKVGETVNKITYRMDESDPWFEERVARIESLESGLRRLCGACEALATERRELAGRAHEAARAIAGYIYIYFFNIKINFEIEENEQANTDFYVLTEHIKDYLGLIGAIKDVFHERVKVFQHWQHSQMQLTKRRENKAKAELANRPEKIEQAANEIIEWESKVERGQQEFDTMSRVIKKELERFEEIRLDQLRDTLLRYLDEHMKHQAQLNLHLQVTKSDASQFHYATAIRYWDAFLPEARAIK-