Monarch geneset OGS2.0

DPOGS208072
TranscriptDPOGS208072-TA2967 bp
ProteinDPOGS208072-PA988 aa
Genomic positionDPSCF300282 - 7183-16948
RNAseq coverage169x (Rank: top 51%)
Annotation
HeliconiusHMEL0033390.073.66% 
BombyxBGIBMGA007745-TA0.097.47% 
Drosophilag-PB0.059.43% 
EBI UniRef50UniRef50_Q16YQ50.059.25%Apl5 protein (Spac144.06 protein) n=2 Tax=Culicinae RepID=Q16YQ5_AEDAE
NCBI RefSeqXP_001659245.10.059.25%apl5 protein (spac144.06 protein) [Aedes aegypti]
NCBI nr blastpgi|1571189130.059.25%apl5 protein (spac144.06 protein) [Aedes aegypti]
NCBI nr blastxgi|1571189130.055.49%apl5 protein (spac144.06 protein) [Aedes aegypti]
Group
Gene OntologyGO:00085650protein transporter activity
GO:00057940Golgi apparatus
GO:00150310protein transport
GO:00054882.4e-156binding
GO:00068863.7e-125intracellular protein transport
GO:00301173.7e-125membrane coat
GO:00161923.7e-125vesicle-mediated transport
KEGG pathwayaag:AaeL_AAEL0084620.0 
 K12396 (AP3D1)maps-> Lysosome
InterPro domain[1-975] IPR0171050Adaptor protein complex AP-3, delta subunit
[649-677] IPR0119892.4e-156Armadillo-like helical
[32-584] IPR0025533.7e-125Clathrin/coatomer adaptor, adaptin-like, N-terminal
[29-957] IPR0160247.7e-90Armadillo-type fold
[713-970] IPR0104742.7e-22Bovine leukaemia virus receptor
Orthology groupMCL12345 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208072-TA
ATGGCTTTAAAAAAAGTTAAGGGTAACTTCGAAAGAATGTTTGACAAAAATCTTACGGATTTAGTTCGTGGGATAAGAAATAACAAAGATAACGAGGCAAAATATATAGCACAATGTATGGAAGAAATTAAAGTTGAATTGCGGCAAGATAATATCGGGGTGAAAGCTAACGCTGTCGCCAAACTAACATATCTTCAAATGTTAGGATATGACATATCCTGGGCTATATTTAATATAATTGAAGTTATGAGTTCAACTAAGTTTACATATAAACGTATTGGATATTTAGCTGCTAGTCAGTCATTCCATGCAGACTCCGAGCTGCTTATGCTTACCACTAATATGATAAGAAAAGATCTGAATGCTCAAAACCAGTACGAAGCAGGCCTGGCGCTTAGTGGTCTCAGCTGTTTCATATCTCATGATCTGGCGAGAGACTTAGCTAATGATATCATGACATTGATGAGTTCTACGAAACCCTATCTCAGAATGAAGGCGGTGTTGATGATGTACAAAGTATTCTTAAGATATCCCGAAGCTTTGAGACCAGCTTTTCCTAAGTTAAAAGAGAAACTTGAAGATCCCGATCCCGGTGTACAGTCAGCTGCTGTGAATGTTGTGTGTGAATTGGCCCGGAAGAATCCCAAGAATTATCTGTCACTGGCTCCCGTCTTCTTTAAGCTAATGACCACCTCCACTAACAATTGGATGTTGATAAAGATAATAAAACTGTTTGGTGCCTTAACCCCATTAGAGCCTCGACTTGGCAAGAAACTGATAGAACCATTAACTAATTTAATACATAGCACGTCAGCCATGTCTCTGCTGTACGAGTGCATCAACACTGTGATAGCTGTTCTTATCAGCATCAGCAGCGGCATGCCGGGCCATGCAGCATCAGTACAGCTCTGCGTACAGAAACTACGGATACTTATAGAGGACAGCGATCAGAATTTGAAGTATCTGGGTCTGCTGGCTATGTCTCGGATACTGAAGTCTCATCCGAAATCAGTTCAAGCTCACAAGGACCTCGTCCTGGCCTGTTTGGATGATAAGGACGAGTCTATAAGACTAAGAGCTCTCGGCCTGCTGTACGGAATGGTGTCGAAGAAGAATTTGATAGAGATAGTGAAGAAACTAATGGTACACATGGAACGAGCTGAGGGTACGCTGTACAGGGACGAGCTGTTGACCAGGATGATTGAGATCTGCTCCCAGAACAACTACCAGCATGTGGTGCACTTCGAGTGGTACATCACGGTGCTGACGGAACTCACTGAAATGGAGACCAGCGCTAAACACGGTTGTATGATAGCCGGGCAGTTATTGGAGGTGGGGGCCCGGGTGTCGGAGACCCGGGCTTTCGCCGCCCGCGAGTGCTCGTCGCTGGTGACCCGCACCGCCGCAACACAACACGCGCCTCGTGCCGCCTCCAGGGAGGTGTTGTACGCCGCCGCCTATGTACTCAGCGAATACTGCACCGAGGAGCCGGTGATGCGATCTTCCCTGTCTCCTCTGCTGGTGTGTGCCGGTCTCCACGCGGGCCCGTCCAGCGCGCATATGCGGGCCCGGGCGGTGTGTGTGCACGCCGCCCTCAAACTCACCGCCAGACTGCTCCTGTTGTATGAGAACAGAGGCGAGCGCACCGCCGCGCTCTCGGTTATCCACGAGACCCTGGCGGGCATGCAGCCTTTACTCAGCAGCGAGGATATGGAGGTACAGGAGCGAGCCCACAACGCTACAGCACTACTGCGTATAGTGTTGAGGAAGATCAACCCCACGGATCCCGCGCTCGGCAGTGACGTCATCCGCAATGACGTCACCGACACCTTGGTAGAACACGAACCGGAACAGAGCAACGGCGTGGACATCATTGGTGATAGTGACATGAATGGCGGAGACGACGAAGGTTTCAGTGGCGGCCTCATAGCAGAGCTGGCTGGTTTGTTCGAGGGTGAGCTCAAGCCCGTAGCGCCCAAGGCACAGAAGAAAGTACCGATGCCGCCGGACCTGGATCTCAGCGAGTGGCAGTCTTCGTCGCGGTGGTCGTCAGACAGCTCCTCCTCTGAGGCGGAAGAGGACGCTCTGTTCGTCGCACCGCAACCAGAACAAAAACCCGCAACACCCGTCACTACACTACAGTCGTTGCGCGAGGCTCGTCTCCTGGAACAGGCCAACAACCCTCACTACTTGAAGGACGACGGCCGCTACCAGCAGGAGGACGAGGACCCGCCCGTCGCTGAGATCGCTTTAGACGTGCCGCTGCAAATAACCGTTAAGAGATCTGATAAGTACTTAATGTCGAAAGAAAACTCGAAGAAGACGAAGGAAAAGAAGAGGCCGTCTAAGAAACGGAAGAACAAAGTAGAGAGACATTCGTCCGAGTCGGAGAGTGACGACGCGTCAGTGTCCCGACCTACGGTGGCGGAGGGCGGCGAGTTACCGGAGGGTGCTGCGGCCTCCGACGACGAGCCTCCGCCCCGCGACGATCCCCACCGAGCGCTTGACCTTGACCTCGACATGCCATTACGCGAGGAAGAGTTGCTCAGCACACGAACGCGATCGTACCCGCTACCGGAGAGTGGCTTACTAAGTAAGAAAACAGAATCCAAGAAAAACAAATCTACGGAAAAGAAAACCTCCGAAAAATCTACTCACAAAAAGAAATCCAAGAGCTCTAAACGGAACAAAGAAGCCGACCTCATATTACCGGAAACGGAAAATAAGGTCGAGGACATACTGTTGATCGAAACTGAAAACGAATCTAAGAGTAATGAGATCGTTAAAAATGATATTCAGGATGACAAGCCAGAGAAAATTAAGACTGAAAAACATAAAAGGTCAAAGAAAGACACTAAGGAGAAAGATTCGAAGAAAAAGAAGACATCTAAGAAAGGAAAACATGAAACTAAATTAGGTTATGAAGAAGCAATAGGTATTTCAACACCAAGCAAAGAGGTTGTATAG

Protein sequence:

>DPOGS208072-PA
MALKKVKGNFERMFDKNLTDLVRGIRNNKDNEAKYIAQCMEEIKVELRQDNIGVKANAVAKLTYLQMLGYDISWAIFNIIEVMSSTKFTYKRIGYLAASQSFHADSELLMLTTNMIRKDLNAQNQYEAGLALSGLSCFISHDLARDLANDIMTLMSSTKPYLRMKAVLMMYKVFLRYPEALRPAFPKLKEKLEDPDPGVQSAAVNVVCELARKNPKNYLSLAPVFFKLMTTSTNNWMLIKIIKLFGALTPLEPRLGKKLIEPLTNLIHSTSAMSLLYECINTVIAVLISISSGMPGHAASVQLCVQKLRILIEDSDQNLKYLGLLAMSRILKSHPKSVQAHKDLVLACLDDKDESIRLRALGLLYGMVSKKNLIEIVKKLMVHMERAEGTLYRDELLTRMIEICSQNNYQHVVHFEWYITVLTELTEMETSAKHGCMIAGQLLEVGARVSETRAFAARECSSLVTRTAATQHAPRAASREVLYAAAYVLSEYCTEEPVMRSSLSPLLVCAGLHAGPSSAHMRARAVCVHAALKLTARLLLLYENRGERTAALSVIHETLAGMQPLLSSEDMEVQERAHNATALLRIVLRKINPTDPALGSDVIRNDVTDTLVEHEPEQSNGVDIIGDSDMNGGDDEGFSGGLIAELAGLFEGELKPVAPKAQKKVPMPPDLDLSEWQSSSRWSSDSSSSEAEEDALFVAPQPEQKPATPVTTLQSLREARLLEQANNPHYLKDDGRYQQEDEDPPVAEIALDVPLQITVKRSDKYLMSKENSKKTKEKKRPSKKRKNKVERHSSESESDDASVSRPTVAEGGELPEGAAASDDEPPPRDDPHRALDLDLDMPLREEELLSTRTRSYPLPESGLLSKKTESKKNKSTEKKTSEKSTHKKKSKSSKRNKEADLILPETENKVEDILLIETENESKSNEIVKNDIQDDKPEKIKTEKHKRSKKDTKEKDSKKKKTSKKGKHETKLGYEEAIGISTPSKEVV-