Monarch geneset OGS2.0

DPOGS214095
TranscriptDPOGS214095-TA2454 bp
ProteinDPOGS214095-PA817 aa
Genomic positionDPSCF300014 - 2153914-2157850
RNAseq coverage46x (Rank: top 71%)
Annotation
HeliconiusHMEL0114216e-15078.25% 
BombyxBGIBMGA006152-TA0.084.20% 
DrosophilaTTLL3A-PA3e-17851.96% 
EBI UniRef50UniRef50_Q17NV10.051.43%Putative uncharacterized protein n=3 Tax=Culicidae RepID=Q17NV1_AEDAE
NCBI RefSeqXP_001648656.10.051.43%hypothetical protein AaeL_AAEL000580 [Aedes aegypti]
NCBI nr blastpgi|1571049730.051.43%hypothetical protein AaeL_AAEL000580 [Aedes aegypti]
NCBI nr blastxgi|1700429490.049.03%conserved hypothetical protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00064641.2e-163protein modification process
GO:00048351.2e-163tubulin-tyrosine ligase activity
KEGG pathwayecb:1000583583e-89 
 K05755 (ARPC4)maps-> Shigellosis
    Pathogenic Escherichia coli infection
    Regulation of actin cytoskeleton
    Bacterial invasion of epithelial cells
    Fc gamma R-mediated phagocytosis
InterPro domain[59-605] IPR0043441.2e-163Tubulin-tyrosine ligase
Orthology groupMCL11592 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214095-TA
ATGGCTGTAACAATGGATTTGGAGGAAAGAGCATCTGATTCTAAGCTGCATAAAGAAAGCTTGGTGAAAAAAGAGTTGCGGCCATCTAGCGCAGTTTCTTTTTCCAAATCTGAACATGGGACAGTGGAAACTTCATCAATAGAGCAATTAAAACAATATAAAAGTTGGGTAAGCAATGAGAGATGGAATGAGCTCAAAAAGATAGCAGACACTGCGATGAAAGAGCGAAAAGTATTTATGATAAAGGGGGGAGGATTCCCTGCCGTGAGACGGGCTCTGTTAGAAAGAGGCTGGGTAGAAAAATATGAATCACATAAGGTACGTCACCCGCCTTCAAACGTAGATCCTAAAAGAGTATCGGGAAAAGAATTGACTAAAGTTGAAAGAATGATCCTTTACAAATTTATGGAACATCATTCTGTGGATTTTTTGTGGACAACTAAAAGAGATAAATATGACTGGCTTCTTAGCAATAAAGAAGTTATTATAAGCAGATTTTGCAGATCCTGTTTCACTACAAAAGAAGGCCTTACGAATTCTTTGACTCAAATGCATTGGTACACTGAACCTGGAGTAGCCTTAACGAAATTCCCGAGGTGCTATAATATCCATAATTCTGATAGTTTGGAAGAGTTTATTGATGATTTTAGAATAACCGCTTGCATAAGTATTCTAAAATGGCTATCCAGTACTCTACAACAAAGAGATGACCAGGAACTTGTAACTAATAATGGAAAAGTTCCATTTTCTGCCATTGAATTCGCAATAAATAGATTAAATGAGTACATTTTATTTTTTACTCATAAAGACATTGATGATACTGAAGATCAGATTCAGCACGTTTGGGAACATGAATGGGACCAATTTCTCACGCATCATTATCTCCTTGTACATGAAAATGCAAAATTTATTGAAGACACAAACTCAAACCTCAAACAATTAGAACGCAAGGCGTCAAAAGTATTGTCAACGATGTCTAAATTCTGGCCCCAAATAGACATAGATGGCGTTTTTAACATTTGGATAGTGAAACCAGGAAATAAATGTCGCGGCAGAGGGATCCAACTCATGAACAACATCAAAGACATCATAGGTCTTATCAATATTCCAGCCCAAAAAACTAGATACGTTGTCCAAAAATATATTGAAAATCCTCTTGTTATTTACGACACAAAATTCGATATACGACAATGGTTTCTAATTACTAATTGCCAACCATTGACGATATGGATATACAAAGATAGCTATTTACGGTTCAGCTCACAAATATTTAGTTTGTCGAACTATCATGAATCAGTACATTTGACGAACAACGCTGTACAAACAAAATATAAAAACAATGGGGACCGGGACAAAGCGTTGCCGGACGAGAATATGTGGGATTGCCACACATTCAAAGCTTATCTTAGACAGATAGGAAAGTATGATATGTGGGATTCGAAAATATATCCTGGAATCAAACAGAGCTTAGTAGGGGCCATGTTGGCGTGTCAGGAGTCCATGGACAAGAGGCAGAATAGCTTCGAGCTGTACGGTGCTGACTTCATGCTGACGGATGACTTTACTCCTTGGCTAATAGAAATCAATTCCAGCCCCGACCTAGCACCCACAACTTCCGTTACTGCTCGCCTTTGTCCACAATGCCTAGAGGACGTTGTCAAAGTTGTGCTAGACAGACGTCTGAACGTAGAAGCGGACACGGGAACGTTTGAGTTAGCCTATAAACAAGTCATACCGAAGGCTCCTGCGTACCTCGGATTATCTTTATGCATCAATGGCAAACGATTGATGCATAGTAAGAAATCCAAAGAGCGGCGGCACGAGCACCGCAGTGTAACGCCGCCAAGTGCGGGCGCGCCGGCTGGGGACGCACAACCGCATGAACAACCCGTGCCTCCCGAATACAATGGACCAATCATCACTGACTTCCTAAGCTGGTTGAACCCTTACGACTCTCTGCCCACCGACAAGGACGGAATACTTCTAGCCACCAAGGACTCCCTCACGGTACGTCGTGCTGTAACCGTTGTAAAGACGACTCGGACGATACAAAATCGTAGCAAAAAGCGTAAACCGTCTGCGATCTCTTCTAGGACATATCGAGGAAGGCGCGAAAGAAACGTAGAATCTAAGCGTCGAATCGTTCCACACTCGTGTTGCCGGCCTGAAGACGGTCAAATAAATAACAACGTCGGTAAATACAGAACGGAATCGGCGAGCAAGGCAGAAAAACCAAAGATTGATAGGTCAGTAGGTAATAAACGCTGCTACATAGACCCTATCGAATGGGAACGTGAGACTGCTAGCATACTCAGGTCAACAATATCCGTGCAAAATAAAATATCTGCTAGAGCGGAAGCTATAAGTAGCGTTAAACAAATTAAATCAGATGACGTTGGATGCCACATACCGAAACCTTCTTTCCCATTCATTCCTAATCCACGACCTCCTTAA

Protein sequence:

>DPOGS214095-PA
MAVTMDLEERASDSKLHKESLVKKELRPSSAVSFSKSEHGTVETSSIEQLKQYKSWVSNERWNELKKIADTAMKERKVFMIKGGGFPAVRRALLERGWVEKYESHKVRHPPSNVDPKRVSGKELTKVERMILYKFMEHHSVDFLWTTKRDKYDWLLSNKEVIISRFCRSCFTTKEGLTNSLTQMHWYTEPGVALTKFPRCYNIHNSDSLEEFIDDFRITACISILKWLSSTLQQRDDQELVTNNGKVPFSAIEFAINRLNEYILFFTHKDIDDTEDQIQHVWEHEWDQFLTHHYLLVHENAKFIEDTNSNLKQLERKASKVLSTMSKFWPQIDIDGVFNIWIVKPGNKCRGRGIQLMNNIKDIIGLINIPAQKTRYVVQKYIENPLVIYDTKFDIRQWFLITNCQPLTIWIYKDSYLRFSSQIFSLSNYHESVHLTNNAVQTKYKNNGDRDKALPDENMWDCHTFKAYLRQIGKYDMWDSKIYPGIKQSLVGAMLACQESMDKRQNSFELYGADFMLTDDFTPWLIEINSSPDLAPTTSVTARLCPQCLEDVVKVVLDRRLNVEADTGTFELAYKQVIPKAPAYLGLSLCINGKRLMHSKKSKERRHEHRSVTPPSAGAPAGDAQPHEQPVPPEYNGPIITDFLSWLNPYDSLPTDKDGILLATKDSLTVRRAVTVVKTTRTIQNRSKKRKPSAISSRTYRGRRERNVESKRRIVPHSCCRPEDGQINNNVGKYRTESASKAEKPKIDRSVGNKRCYIDPIEWERETASILRSTISVQNKISARAEAISSVKQIKSDDVGCHIPKPSFPFIPNPRPP-