Monarch geneset OGS2.0

DPOGS202703
TranscriptDPOGS202703-TA1785 bp
ProteinDPOGS202703-PA594 aa
Genomic positionDPSCF300324 + 105994-110844
RNAseq coverage42x (Rank: top 72%)
Annotation
HeliconiusHMEL0123642e-13959.80% 
BombyxBGIBMGA004815-TA0.082.80% 
DrosophilaBili-PF5e-10547.26% 
EBI UniRef50UniRef50_E0VYG05e-11151.84%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VYG0_PEDHC
NCBI RefSeqXP_969515.18e-11648.55%PREDICTED: similar to GH28553p [Tribolium castaneum]
NCBI nr blastpgi|910939872e-11448.55%PREDICTED: similar to GH28553p [Tribolium castaneum]
NCBI nr blastxgi|910939877e-11048.55%PREDICTED: similar to GH28553p [Tribolium castaneum]
Group
Gene OntologyGO:00054887.3e-15binding
KEGG pathwayssc:1001566602e-09 
 K06271 (TLN)maps-> Focal adhesion
InterPro domain[275-403] IPR0197485.5e-22FERM central domain
[278-400] IPR0143527.3e-15FERM/acyl-CoA-binding protein, 3-helical bundle
Orthology groupMCL13070 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202703-TA
ATGGATCATAAAACAGATATGGGTGCTTATGGAGAAAATTCGAGTGTACAAAATAGCGGAAACTATATAACTATTATACCCGTGGAATATTCCCGTCGAGGATATCCAATTCAAACGGAATATCAATATAATGAATTCCGATCGCGGGAGGAGTATTACAGTGTTCAAAGTCAGAAATTACAGGCGGAGGTGAACGCGCATTTGTACTCCGTTTCTCAACGTCTTCCGCACTTATCCTACAGCCCGCTTGGAAGACATAATGACTGGGACAAACATGATTCTCACAAACCGCTACCAGACTCTAAAGAGAGTAATTCTTCAGACTCTCTGGAGAAAAGTGTTATGGGGTCGAATCTCAGCAGCTGTTCCAATCAGACCGGGTCCAGTATAGCGGCAGGTTCTTTTAGCACCACTGAACCCATTACCAGCGGGGGTAGTTCAGCTAGTGCCATATCACCGCCCCTGCCTGGTTCATCAGGGTCATCGACCAGTAGCGCTTCAAACCTGGTAACCTGCGTGTATCTGATGTCGAGGACCGCGGTCATGGTGGAGATGAGCTCGGAGGAGGTTCCATGGTGCGTGACCGCCGGCAGGTTGCTTGCGGCTGTACTTTCAGCTGAGGAGTTGGGTCTTGCTGCACCAGCCAGGAGCCTCGCCGCCAGTGTTTTCTCATTGTGGATGTGCAGCCCGCTACTGGAGATTCAGCTCAAGCCGCACCACTGTCCCTGGCGGGTGTGGGCGGCCTGGCAGCAGCTACTAGTGCGCTACGGACACGGATCTCCGTCCAGACGAGCCAAGGACCAGCCCGTGCTCAGGATGCAGAGAAACGTTTTCTTCCCAAAACATCTAGAGGAAGGAATAAAAGATTCAAAAATTCAAGAGTTATTGTACGAAGAAGCGAGGTACAACGTGGTTAATGGAAGGTATCCGCTGGAGAGCGCTCAAGCGGTGATGTTAGGCGGACTGCAGGCTCGGATTCAGCTCGGTCCATATGACCCTCATCGTCACACCGCTAAATTTTTCAGAGAGCACCAGCGCGAGTATCTACCAGCGCACGTGCGTCGTGGCGGCTGGGCGCGTTTAGTGCCGGCGGGTCGGAAGGGCAGCCCTGAGGCACGTCTCTTAGAGCACGCACAGCGACCTCCAGCCGCCCCGCCCCGGAAACTGAGACACAAATACCTGACACACACCCGGACACTACCCACTTACGGGGCAGCGTTCTTCCAAGGTCAGATAGAGCAGCCTCTCCGTAGCCTGACCAGTCTGCTGACACACGAGGACATCCCGGTGCTGGTCGCTATCAACGCCTGTGGGGTCTACGTTATTGATGACACGGAGAGTACGGTACTCCTCGGTCTACTCTATGAGGAGCTGTCATGGGATATCGGTCTACCGTCAGATGACAACGAGGATTGTCTGCCGTGCCTCTTTCTCCAATTCATGGTCGTGGAGAATGGACTGCGAGTGTCGAAAATACTACAGGTATTTTCAAAGCAGGCGATCATGATGGACACTCTTATAGAACACTTCGCTGGGGAATACAGAAGGAGAATCGGTCAAGACACCCCGAGCGAACACGCCAACTATGACTACCACTCAGATTCTGGAAGCATTTCTCTGCCTCCCCTCTCCCGACCGGACTCCCCCCAGAGGCGTCTCGCCAACAAGCTCAGTCGACTGGCGCTCGCCACACACGACGGACGCGGCAACCTGCTGGGGGGCGCCGGGGACTGGAACAGCGGCCTTCACCACCGACAGCCCTCCTGGATCCTACCCAAACATTAG

Protein sequence:

>DPOGS202703-PA
MDHKTDMGAYGENSSVQNSGNYITIIPVEYSRRGYPIQTEYQYNEFRSREEYYSVQSQKLQAEVNAHLYSVSQRLPHLSYSPLGRHNDWDKHDSHKPLPDSKESNSSDSLEKSVMGSNLSSCSNQTGSSIAAGSFSTTEPITSGGSSASAISPPLPGSSGSSTSSASNLVTCVYLMSRTAVMVEMSSEEVPWCVTAGRLLAAVLSAEELGLAAPARSLAASVFSLWMCSPLLEIQLKPHHCPWRVWAAWQQLLVRYGHGSPSRRAKDQPVLRMQRNVFFPKHLEEGIKDSKIQELLYEEARYNVVNGRYPLESAQAVMLGGLQARIQLGPYDPHRHTAKFFREHQREYLPAHVRRGGWARLVPAGRKGSPEARLLEHAQRPPAAPPRKLRHKYLTHTRTLPTYGAAFFQGQIEQPLRSLTSLLTHEDIPVLVAINACGVYVIDDTESTVLLGLLYEELSWDIGLPSDDNEDCLPCLFLQFMVVENGLRVSKILQVFSKQAIMMDTLIEHFAGEYRRRIGQDTPSEHANYDYHSDSGSISLPPLSRPDSPQRRLANKLSRLALATHDGRGNLLGGAGDWNSGLHHRQPSWILPKH-