Monarch geneset OGS2.0

DPOGS208125
TranscriptDPOGS208125-TA1854 bp
ProteinDPOGS208125-PA617 aa
Genomic positionDPSCF300154 + 36102-40415
RNAseq coverage168x (Rank: top 51%)
Annotation
HeliconiusHMEL0122740.088.57% 
BombyxBGIBMGA006567-TA0.085.64% 
DrosophilaCG2258-PB2e-9041.99% 
EBI UniRef50UniRef50_E2A7B51e-15745.95%Transcription initiation factor TFIID subunit 5 n=25 Tax=Eumetazoa RepID=E2A7B5_CAMFO
NCBI RefSeqXP_001604497.13e-16046.96%PREDICTED: similar to ENSANGP00000012709 [Nasonia vitripennis]
NCBI nr blastpgi|3320232101e-16047.28%NCK-interacting protein with SH3 domain [Acromyrmex echinatior]
NCBI nr blastxgi|3320232102e-15747.13%NCK-interacting protein with SH3 domain [Acromyrmex echinatior]
Group
Gene OntologyGO:00055153.1e-13protein binding
KEGG pathway 
InterPro domain[446-584] IPR0185563.2e-34Domain of unknown function DUF2013
[10-73] IPR0014523.1e-13Src homology-3 domain
Orthology groupMCL13650 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208125-TA
ATGACTGAAGGACAACTCTTAACTGATGATTTAGAAATGCTCACAGCATTATATGATTTCGAGGCAACCTTAGCCAAGACTCTGAGTTTTACGGAAGGCGATTTTTTTTATCTTCAACAAAGTAACACTAAGCAAAGAAATTGGTGGCATGTTGTTAATAGGAAAGGTCAAGTGGGCTTTGTGCCTTCAAACTACGTAGCTTCAGTCAAGGTCGAGCCAGATTATTTTCTTTCATTCTTGAATGACTGCATAAAGAGTTTAAATGAAACCAATGCTATGACACCTCAGAAGCAAGAGACTTTGACTAAACTGTATGAAAAGAAGAAGCAACTCCAAGCTCTTGTGAAGCCCAGTAAAAAGCAAGCTGCTCCAAAACCTCCTCCCCGGCTAGATGACAACACACCTCCAAATGGTGTATCACCATCATTTGATCTGCGTCCTGTTGTGTCACAACAGCATATTTCTGAAGAAAAATCATGCTCATCGCCATCTAAATCATTGAACAGCAATACATATAAAGATAATGATGAAGATAAGAAGGGCTATTCCAAATCTGCTTCTAATCAGTTGTCGTCGGACACAGAAAACCAGGATGACAGTCAAGACAGCAGTGAAAGCATTAAGCCAAATGCTATTTATGAAATTGTTCAAGCTGTGAGAAAAGAAACTCAATTAAGCCATGAAATGTCCAAAGTTGCTGTGGAAACTGTTCTGATCTCCTTAAGGGAATTTCTCCCCGGCGGTGCTGCAAGGTCCATAATAGATGCGCTGCTACGAGAAGCAAACAGCAATATAACATGTCCAAAGAATGCTATCGATGCTGCACCAGATGCGCTACGAATGATGACAGCACTGAATGCACTATCGAAGGCAGCGAATGATGCACAGCAACGAGGCTGGGCTTTACATGACGATGCCCATGATATACAGACACAGTTGTTGGAACTTATCTCAGTCATGTCGAATGCTGATGTCAATATATCCCAGCATGTGCTTAGTAGCCATCGTTATGCATATGTGACAACCCTTGTGCAGTACTACCAAATGGAAACACGTTGGCCGCTTAGACAGCTGCTGTTGCAAGCATTTGGTGTAATGTGCGGTTTGGAACGCACAGCGCTGGCAACCCTGGCCTTATCAGCTCTGCCAGCGGAAATAGCTCGTGACATGCGTGACAACCCTCGAGCTGTCAGCCGTCTGTCGCACTCAGCGCTACTGTTGTCTATGGTGTTGTCTATGGGGGACAAACTTCCTATCACGCACTTCGAACAATTAGGAGTTGACTTTGCACAATTTCTGCTTGAGCTTATAGAAAATCCTCCGGAGACAGACGTTGATGAACAAATACCCGATTTATTCCTAACATTACTCCTCGCATATAATCTTCAATTCGAGAGTCCTTTCGAAAATCTTCTGCTAAACGCTTTAGAGGTGATGAATAACGCTAAAACATTTTGTGAAAAAGTTCTCCTGTTGCTCAATCGAGAGGAAGATCCTGTCCACATATTCGATCACGAACCAGCCCCAGCCCACTCCGTGTTAAAGTTGGTTATCGATTTGTTCAGTCGCAAGAAGACTGCCGAACATTTCTACACCAACGACGTTAAAGTTGCCATCGATATAGTCGTTAGGCAACTAGCAGATCTGTCGCCCGGCGATTTGCGTCGTCAGCAATACCTGAAGATACTCCAAGGTATAATCCGCAATACAGACTACGGAGCGCATTTACACCGGAGAGATGATCTCCTTCGATGCTTTGCGAGAATATTCTGCGAAGAAGGTGATATAAGCCGAGACGATCAGACACTAGTAAGAGCCATATCTAACGAATTCCCACAGTACTTTAAAGCCTAG

Protein sequence:

>DPOGS208125-PA
MTEGQLLTDDLEMLTALYDFEATLAKTLSFTEGDFFYLQQSNTKQRNWWHVVNRKGQVGFVPSNYVASVKVEPDYFLSFLNDCIKSLNETNAMTPQKQETLTKLYEKKKQLQALVKPSKKQAAPKPPPRLDDNTPPNGVSPSFDLRPVVSQQHISEEKSCSSPSKSLNSNTYKDNDEDKKGYSKSASNQLSSDTENQDDSQDSSESIKPNAIYEIVQAVRKETQLSHEMSKVAVETVLISLREFLPGGAARSIIDALLREANSNITCPKNAIDAAPDALRMMTALNALSKAANDAQQRGWALHDDAHDIQTQLLELISVMSNADVNISQHVLSSHRYAYVTTLVQYYQMETRWPLRQLLLQAFGVMCGLERTALATLALSALPAEIARDMRDNPRAVSRLSHSALLLSMVLSMGDKLPITHFEQLGVDFAQFLLELIENPPETDVDEQIPDLFLTLLLAYNLQFESPFENLLLNALEVMNNAKTFCEKVLLLLNREEDPVHIFDHEPAPAHSVLKLVIDLFSRKKTAEHFYTNDVKVAIDIVVRQLADLSPGDLRRQQYLKILQGIIRNTDYGAHLHRRDDLLRCFARIFCEEGDISRDDQTLVRAISNEFPQYFKA-