Monarch geneset OGS2.0

DPOGS211476
TranscriptDPOGS211476-TA1287 bp
ProteinDPOGS211476-PA428 aa
Genomic positionDPSCF300113 - 250409-254017
RNAseq coverage618x (Rank: top 21%)
Annotation
HeliconiusHMEL0172833e-5671.05% 
BombyxBGIBMGA007989-TA2e-7369.54% 
DrosophilaCG13830-PC2e-4539.51% 
EBI UniRef50UniRef50_E9G4T21e-5429.45%CG13830-PA-like protein n=1 Tax=Daphnia pulex RepID=E9G4T2_DAPPU
NCBI RefSeqXP_393267.28e-6737.98%PREDICTED: similar to Nucleosome remodeling factor - 38kD CG4634-PA [Apis mellifera]
NCBI nr blastpgi|3838550634e-7237.27%PREDICTED: testican-2-like [Megachile rotundata]
NCBI nr blastxgi|3800155294e-7137.23%PREDICTED: testican-2-like [Apis florea]
Group
Gene OntologyGO:00055097.6e-13calcium ion binding
GO:00071651.2e-10signal transduction
GO:00055781.2e-10proteinaceous extracellular matrix
KEGG pathway 
InterPro domain[328-386] IPR0007169.4e-19Thyroglobulin type-1
[219-327] IPR0119927.6e-13EF-hand-like domain
[231-307] IPR0195771.2e-10SPARC/Testican, calcium-binding domain
Orthology groupMCL12503 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211476-TA
ATGCGGCCTAGGGGGCTGGCAGCTCTGCTGGCTGCCCTGCTGGCTGCGCTGCCTGCCAACGCCGCTCAGAGGAGGATCGACCAGGACTTCGAGTTCGACGATGCCCCGGAGACGGCAGTAACGCACGCGCGTCGTCCTCGGAGATATGTTTACGATCCTGAGAATCCGATGTGTCGCGCCCTCGTGTGCAAGAAGCGCGAGGTGTGTCTGCTGCGCGACTCCTACACCGCGCTTTGCGCCTCCAAGAAAGACGTTCTCAGGAGAGGAGATCAAATAGTTTCGGACGTGTCGAGCGATGACGAGGACGTATTCTACGAGTCTTCCTCGGAGCACCGAGCGGGGTCAAGTCCCGGGTCCGGTCCGGGGTCAGGTCCGGGGTCCGGTCCGGGGCCCGGCCGTTGTGTGGGGTGCGGCGGGGCGGCGCGCGCGGCCTTCCTCTGCGGCTCCGACAACCGCACCTACTCGTCCTTGTGCCGGCTGGACCTCCATAACTGCGTGCACCGCTCGTCGCCGCCCGTGCGCCTCGCCTGCCGCGGCTTCTGTCCCTGCAAGCCCCGAGCGCCGCGCCCTCGACCGCACCGCACCCGCGGCTTCTACGAAGATGAACGAAGACGTCGCCGACTCGACTCTCACAACGAGGTGTACGAGCGCCCTTTCCGCCGTCCGTCTCAAGACGTTGATGGTTGCGCTCTCGATAAGATGGCTAACCGTCTCCTTGACTGGTTCTCCGTCTTGATGGACGAGGCGGGAGGCGTTCCTCCGCCGCAAGAAGGTTTCCCCTCAGGCTGTAAGCCCGAGGTCCGTTGGATGTTCTCCCACCTGGACGCCGGCGGCGACGGGCTGCTGTCTCCGAGCGACTTGTACGCGCTCCGTCACGACGAGCGCGAGCGTTGTCTCCGGCCGTTCCTGTCGTCGTGTGGTCCGGGGGCGCTGTCTCGTTGGTCCTGGTGCGGATGCCTGTCCCGGGCCTCTCGGCCGTGCGCGGCGCTGTCTCGGGCTCACCCCGCTCCTCCTCCGGGCTCGTACGTGCCGTCCTGCGATTCTCGCGGTTGGTACAGACCTCGCCAGTGTCACGCCGCACTCGGCGTGTGTTGGTGTGTGGACGCGCATGGCGTGGAGCTCGCCGGCTCGAGGACTAAGGGAAAGCCTCGGTGCCCCGGTGAAGACGAGGCTGAGGAGACGGAGTCTCGGGGGGGTGAGGTGAGGGAGGGTGACGTGAGGGGAGCACCAGATGACGATGAGGATGCCGGAGGCAGCGGCGACAGGAACAACGAACTGAGGTTCTAA

Protein sequence:

>DPOGS211476-PA
MRPRGLAALLAALLAALPANAAQRRIDQDFEFDDAPETAVTHARRPRRYVYDPENPMCRALVCKKREVCLLRDSYTALCASKKDVLRRGDQIVSDVSSDDEDVFYESSSEHRAGSSPGSGPGSGPGSGPGPGRCVGCGGAARAAFLCGSDNRTYSSLCRLDLHNCVHRSSPPVRLACRGFCPCKPRAPRPRPHRTRGFYEDERRRRRLDSHNEVYERPFRRPSQDVDGCALDKMANRLLDWFSVLMDEAGGVPPPQEGFPSGCKPEVRWMFSHLDAGGDGLLSPSDLYALRHDERERCLRPFLSSCGPGALSRWSWCGCLSRASRPCAALSRAHPAPPPGSYVPSCDSRGWYRPRQCHAALGVCWCVDAHGVELAGSRTKGKPRCPGEDEAEETESRGGEVREGDVRGAPDDDEDAGGSGDRNNELRF-