Monarch geneset OGS2.0

DPOGS206111
TranscriptDPOGS206111-TA1236 bp
ProteinDPOGS206111-PA411 aa
Genomic positionDPSCF300028 + 565771-567006
RNAseq coverage291x (Rank: top 38%)
Annotation
HeliconiusHMEL0140640.097.81% 
BombyxBGIBMGA006832-TA0.092.21% 
DrosophilaTBPH-PC3e-11165.12% 
EBI UniRef50UniRef50_Q131482e-9959.63%TAR DNA-binding protein 43 n=68 Tax=Coelomata RepID=TADBP_HUMAN
NCBI RefSeqXP_392590.32e-13459.48%PREDICTED: similar to TBPH CG10327-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3287838093e-13559.72%PREDICTED: TAR DNA-binding protein 43-like [Apis mellifera]
NCBI nr blastxgi|3287838095e-14058.11%PREDICTED: TAR DNA-binding protein 43-like [Apis mellifera]
Group
Gene OntologyGO:00001662.4e-19nucleotide binding
GO:00036766.6e-17nucleic acid binding
KEGG pathway 
InterPro domain[105-174] IPR0126772.4e-19Nucleotide-binding, alpha-beta plait
[106-177] IPR0005046.6e-17RNA recognition motif domain
Orthology groupMCL11120 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206111-TA
ATGTCCTTCGAGTACTTGCCCGTGGCCGAAGATGAAAATGAAGAACCCATAGAACTTCCAATTGAAGAAGATGGGACTTTGATGTTAACAACTGTATCCGCGCAGTTTCCTGGCTGTTGTGGACTCAAGTATCGTCATCCTGAAACGAAAACGTTTAGAGGTATAAGATTACGAGATGGTAGGCTTTACCCACCACCAGAAGGTTGGGGAAATCAACTGTACATATGCAGTTTTCCGAAAGAAAATAAACGAAAATCTGGTGACAATTCTGAAACATCTTCGGTGAAAAGTAAACGGAATGATAATTTGTGCTCAGATTTAATAGTGTTGGGGTTACCATGGAAAGCAACAGAGCAAACCGTGCGGGAGTATTTCGAAAAGTTTGGCGAAGTGTTAATGGCTCAATTAAAACGTGATCCTAAAACCGGTATGTCAAAAGGCTTCGCTTTTATTAGATTTTCATCTTACACATCTCAGATGAGAGTCTTAGCTCAAAGACATATGATTGATGGACGTTGGTGTGATGTACGGATACCTAACTCAAAGGAAGGTTCTGTCACATCTATGCCTTGTAAAGTTTTTGTTGGCCGCTGTACAGAAGATTTAACAGCCAATGATTTAAGAGAATATTTTTCACAATTTGGTGAAGTAACAGATGTTTTTATTCCAAAGCCTTTTAGGGCATTCAGCTTTATAACATTTTTGGATCCTGAAGTTGCACAAAGCTTATGTGGTCAAGACCACATTATAAAAGGAGTATCTGTAAATGTGTCTAATGCATCACCTAAACAAAATAAAAGTGGTTCTAATCAACGAAACTTACCAAGTAGAAACTATGAAGAAGGACATCCACACAGTGCCTCAAACAATAATTCATGGAGTAGCCGTAATATGGATATGGTGAATATGCAAGCCTTAGGATTGTCTGGCCAACACGGTCAAACCGCCGTGGCCGGTGGTGGAGGGCAAGGCCAAGGTGGAAGTATGCCACTCGGCATGGGTGGTTTGCCAGTAAATCAAGCTCTAGTAGCTGCTGCACTAAATCAGGCAGCAGGCTGGGGTTTAATTAATAATATACCATCGGGGGGATCAGATCAAGGTGCCTTTGCTGGACCGGCTTCTTCTGCTCCACCAGCACCACCTAACTTCCTGTCATGGATGCAACAGGGCAATTCTGGACAAGGACCTTCTAGTCAGTGGGGACAGAGACACCAATCCCAAGGCCACTCCGTTTGA

Protein sequence:

>DPOGS206111-PA
MSFEYLPVAEDENEEPIELPIEEDGTLMLTTVSAQFPGCCGLKYRHPETKTFRGIRLRDGRLYPPPEGWGNQLYICSFPKENKRKSGDNSETSSVKSKRNDNLCSDLIVLGLPWKATEQTVREYFEKFGEVLMAQLKRDPKTGMSKGFAFIRFSSYTSQMRVLAQRHMIDGRWCDVRIPNSKEGSVTSMPCKVFVGRCTEDLTANDLREYFSQFGEVTDVFIPKPFRAFSFITFLDPEVAQSLCGQDHIIKGVSVNVSNASPKQNKSGSNQRNLPSRNYEEGHPHSASNNNSWSSRNMDMVNMQALGLSGQHGQTAVAGGGGQGQGGSMPLGMGGLPVNQALVAAALNQAAGWGLINNIPSGGSDQGAFAGPASSAPPAPPNFLSWMQQGNSGQGPSSQWGQRHQSQGHSV-