Monarch geneset OGS2.0

DPOGS204819
TranscriptDPOGS204819-TA1371 bp
ProteinDPOGS204819-PA456 aa
Genomic positionDPSCF300221 + 63740-75932
RNAseq coverage122x (Rank: top 57%)
Annotation
HeliconiusHMEL0165077e-16570.42% 
BombyxBGIBMGA001559-TA4e-18078.07% 
DrosophilaendoA-PB6e-12582.95% 
EBI UniRef50UniRef50_D6W8662e-16970.09%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6W866_TRICA
NCBI RefSeqXP_001655374.16e-16969.14%endophilin a, putative [Aedes aegypti]
NCBI nr blastpgi|2700147457e-16970.09%hypothetical protein TcasGA2_TC004801 [Tribolium castaneum]
NCBI nr blastxgi|3407214873e-16970.40%PREDICTED: endophilin-A-like isoform 2 [Bombus terrestris]
Group
Gene OntologyGO:00055156.6e-79protein binding
GO:00057376.6e-79cytoplasm
KEGG pathwayame:4086237e-137 
 K11247 (SH3GL)maps-> Endocytosis
InterPro domain[5-240] IPR0041486.6e-79BAR
[380-435] IPR0014522.7e-21Src homology-3 domain
[382-402] IPR0001084.6e-06Neutrophil cytosol factor 2 p67phox
Orthology groupMCL11724 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204819-TA
ATGGCGTTCGCTGGACTGAAGAAACAAATAAATAAAGCGAATCAGTATGTCACAGAAAAAATGGGCGGTGCCGAAGGCACGAAGCTTGATTTGGACTTCATGGAAATGGAGAGAAAAACGGACGTAACCTGCGAGCTGGTTGAAGAACTACAAATAAAGACAAAAGAATTTCTACAACCGAATCCAACGGCTAGAGCTAAGATGGCAGCCGTTAAAGGTATCAGCAAACTTAGTGGACAGGCGAAGAGTAACACCTACCCACAACCAGAAGGAGTCCTAGGAGATTGCATGCTGCTCTATGGAAAGAAACTTGGAGAAGACTCGGTATTCTCACAATGTCTGATAGAGATGGGCGAAGCTTTGAAGCAAATGGCTGATGTAAAATATTCCTTGGATGATAACATCAAGCAGAGCTTCCTGGAGCCCCTGCACCACTTGCAGACCAAGGACCTTAAAGAAGTCATGCATCACCGAAAGAAATTACAGGGACGTAGATTGGACTTCGATTGCAAGCGACGAAGACAGGCCAAAGACGATGAGATCAGACAGGCCGAGGAGAAGTTCGCCGAGTCATTGACACTGGCGCAGATTGGCATGTTCAATCTTCTGGATAATGATGTTGAGCAAGTGGCGCAGTTGTCTTTCTTCGCCGAGGGTCTTCTGGAATATCATCAGCAATGTACGGAGATTCTCAAAGGACTGGTGTCTACCCTAATGGAGAAGAAAGAGGAGGCCGTGAACCGTCCCAAAATGGAGTTCGTGCCAAAGACGCTTGCCGACCTTCACATCGAGGGCATCCATGACTTGAACCATGGTAGGCGGTTCGGCTCCACTCCGAGCCTTTCCCGCGCCCAGCCCTGTAGATCCAACTCCTTCGACCTCCTGCCCCCCACCAACGACCCCTTAAAGGCCTGGGAGAACCTCCCCCCTCCCTACCACAACCACTTCAAACCGCACCCCGCACCAAGGATAACTAATGGACGGACAGAGGGCGGATCCCGCGCGAGCTCCCCCGACCACAAGCCGCCGGCCAACCTCGACCTGTTCCCGGCAACTACCCAGCGCTCGAACAACGCGTCTCCGCTTCCGTCGCCGGTCAAGTCTCCAGCGAGGACTCCGATGGTTCAGGCCAAAGGGCCGTGCTGCACCGCCCTATACGACTTCGACCCCGAAAACCAGGGCGAGCTCGGCTTCAAGGAGAATGACGTCATCACCCTCATTTCGAAGGTCGACGAAAACTGGTTCGAGGGTTCCGTGAACGGGAAGACCGGCTACTTCCCGATCAGCTACGTTCAAGTCAACGTTCCGCTCCCCAACATAGCTGACAAATACGACGCGGCCACATTGAGAAGACATTGCTACATCAAATAG

Protein sequence:

>DPOGS204819-PA
MAFAGLKKQINKANQYVTEKMGGAEGTKLDLDFMEMERKTDVTCELVEELQIKTKEFLQPNPTARAKMAAVKGISKLSGQAKSNTYPQPEGVLGDCMLLYGKKLGEDSVFSQCLIEMGEALKQMADVKYSLDDNIKQSFLEPLHHLQTKDLKEVMHHRKKLQGRRLDFDCKRRRQAKDDEIRQAEEKFAESLTLAQIGMFNLLDNDVEQVAQLSFFAEGLLEYHQQCTEILKGLVSTLMEKKEEAVNRPKMEFVPKTLADLHIEGIHDLNHGRRFGSTPSLSRAQPCRSNSFDLLPPTNDPLKAWENLPPPYHNHFKPHPAPRITNGRTEGGSRASSPDHKPPANLDLFPATTQRSNNASPLPSPVKSPARTPMVQAKGPCCTALYDFDPENQGELGFKENDVITLISKVDENWFEGSVNGKTGYFPISYVQVNVPLPNIADKYDAATLRRHCYIK-