Monarch geneset OGS2.0

DPOGS215767
TranscriptDPOGS215767-TA1425 bp
ProteinDPOGS215767-PA474 aa
Genomic positionDPSCF300041 + 1555573-1560384
RNAseq coverage387x (Rank: top 31%)
Annotation
HeliconiusHMEL0141090.071.79% 
BombyxBGIBMGA003644-TA0.074.41% 
DrosophilaCG5734-PA2e-6135.63% 
EBI UniRef50UniRef50_Q7Q3532e-10443.84%AGAP011491-PA n=5 Tax=Arthropoda RepID=Q7Q353_ANOGA
NCBI RefSeqXP_969378.21e-10744.89%PREDICTED: similar to sorting nexin [Tribolium castaneum]
NCBI nr blastpgi|1892336932e-10644.89%PREDICTED: similar to sorting nexin [Tribolium castaneum]
NCBI nr blastxgi|1892336932e-11145.72%PREDICTED: similar to sorting nexin [Tribolium castaneum]
Group
Gene OntologyGO:00055152.8e-18protein binding
GO:00071542.8e-18cell communication
GO:00350912.8e-18phosphatidylinositol binding
KEGG pathway 
InterPro domain[1-108] IPR0016832.8e-18Phox homologous domain
Orthology groupMCL13049 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215767-TA
ATGCATTTTTCTATCCCTGATTTACAACAATTTCGCGATGACAATGGAATCACTTACACAGGCTACAATGTCTACATCGATGGATTCTTTCACTGCACGGCTCGATACAAACAGCTGCTTAGCCTTCACGAACAACTGCAAGCTCAATATCCGCACTTTAAGTTACCACAGTTTCCACCTAAGAAATTATTTTTAACTAACTCTCAGCTGGAAGAAAGAAGGACTTTGTTAGAAAAATACATTCAATTAATTGGACAAAATCCTGTTTTTGCAAATTCTGGTATACTCATAACATTCCTATTTTCTGCTCAACAAGAAACTCATTCAGTTAGAGTGCATGTAGTGGATATAGAGGTGTCACTGATGAATGGTTATAGAATACCACTGTCGGTGTCCTCAACGGATAGCTCTAGTACAGTCTTAGACATAGCATGCAACTATGTTAACCTGTCAAAAGATCTGACCAAATATTTCTCATTATATCTCTTTAATTGGAGCTGCGCTAAGGACAGACAGCCATGTATCAAGAAATTAGAAGAATATGAATCTCCCTACATATCCCAAAAGTATGTGAGGCCGGAAGACAAAATTGTGTTGCGGAAAAGTTACTGGGATCCATGTTACGACCTAGATTTAATGATAGACAGGGTTTCACTAGATTTATTATATCTCCAACTCATCGAAGAACTTGATCTCGGCTGGATGGTTGCCGATCAGGGGACGAGGGAAATTCTAAGTGATCATGAAGCTAAGAAACAAAAGAGAGAGTACATAGAAATGGCCCGTACGTTGAGGCACTACGGCAGTGTTCCCGCCGGTGAGGCGATTACTGAGGCTATTAACGTCGGTGATAGTAATGGCTCTATCCGAGTAAGGGTGTCTTTGGCTTCCAAGGAACTGACCCTCACCAGCCTCGACACAAGACACGAACAGAGGTACAAAGTCACCAGGATGAGGTGCTGGAGAATAACTACCTTACACACCATGGAACGTCAACAAACGAACGGACACGACTCCCTGATGGACGAGCCGAGTAAAAACTTCGAACTCTCCTTCGAGTATCTCATAAGCAAAGACAATCTCGTTTGGGTTACGCTTAGAACAGAACACGCTATATTCATAAGTGTCTGTTTACAGTCAATCGTGGAGGAGCTGATGCGTCAGAAGAACGGCGAAGGTCCTAAATCCCCTCGTTGTAAGCGAGCGAGTCTGACTTACCTCCGACGTGACGGTTCAACTCACCTCATCACACCGTCATCCTCCAGCGATACTCTCAGCTCTGCGAATGGCGATTCATCTGGCAGTTCGAGGGAATTGTTTTCAGTACAAAAGCTAACAGAAAAATTCGCATCAGTGGCCTTCAAGACGGGCAGAGATTGTGTTGAAAACAACGCGTTTGAGGCCATCGGTGATGAGGAGCTATAA

Protein sequence:

>DPOGS215767-PA
MHFSIPDLQQFRDDNGITYTGYNVYIDGFFHCTARYKQLLSLHEQLQAQYPHFKLPQFPPKKLFLTNSQLEERRTLLEKYIQLIGQNPVFANSGILITFLFSAQQETHSVRVHVVDIEVSLMNGYRIPLSVSSTDSSSTVLDIACNYVNLSKDLTKYFSLYLFNWSCAKDRQPCIKKLEEYESPYISQKYVRPEDKIVLRKSYWDPCYDLDLMIDRVSLDLLYLQLIEELDLGWMVADQGTREILSDHEAKKQKREYIEMARTLRHYGSVPAGEAITEAINVGDSNGSIRVRVSLASKELTLTSLDTRHEQRYKVTRMRCWRITTLHTMERQQTNGHDSLMDEPSKNFELSFEYLISKDNLVWVTLRTEHAIFISVCLQSIVEELMRQKNGEGPKSPRCKRASLTYLRRDGSTHLITPSSSSDTLSSANGDSSGSSRELFSVQKLTEKFASVAFKTGRDCVENNAFEAIGDEEL-