Monarch geneset OGS2.0

DPOGS215400
TranscriptDPOGS215400-TA1299 bp
ProteinDPOGS215400-PA432 aa
Genomic positionDPSCF300088 + 295157-301495
RNAseq coverage242x (Rank: top 43%)
Annotation
HeliconiusHMEL0036734e-14866.89% 
BombyxBGIBMGA012488-TA7e-0754.29% 
DrosophilaCG7526-PD5e-0932.50% 
EBI UniRef50UniRef50_D6WCR52e-6339.29%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WCR5_TRICA
NCBI RefSeqXP_973054.14e-6439.29%PREDICTED: similar to collagen and calcium binding EGF domains 1 [Tribolium castaneum]
NCBI nr blastpgi|910764987e-6339.29%PREDICTED: similar to collagen and calcium binding EGF domains 1 [Tribolium castaneum]
NCBI nr blastxgi|910764985e-6938.63%PREDICTED: similar to collagen and calcium binding EGF domains 1 [Tribolium castaneum]
Group
Gene OntologyGO:00055091.5e-08calcium ion binding
KEGG pathway 
InterPro domain[138-178] IPR0018811.5e-08EGF-like calcium-binding
[261-292] IPR0081605.2e-06Collagen triple helix repeat
Orthology groupMCL22710 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215400-TA
ATGTCGGCCGCAGTCACAATGCGCGCGCCGCTCGCACCGGCCTCGCTCCTCCTGACACATGCGCTCCTGGTACTCGGGCGACTACATGAAGGATACTACGCCGAGGACGGTTACCATGACGACGCCCTGGATGTGGTGGACCTAGTCTCGGCCTGTCCTACAGACAGACTGCTTCGTACCAGAGAGACTTGTCACGTGGAAGGGGCTGACGTTCAGTGTATCACATTACACTGCTGTGAAGACTACAGTTACATCGCTGGACGCTGTATCCGTAACTCTGTGGACGCGTGTAGTCTCCACCTGTGTGAGCAGGCTTGTGAGGTTCAGGAGCAGCGTGTGTGGTGTTCCTGTCACCCTGGGTACAGGTTCGACGCTGATAGTTACAATCGGAAAAGGCAGCCTTACTGTGTAGATATAGATGAATGCACCATCAATAATGGCGGCTGTGAGCACCGTTGTGTGAACGACCCCGGCGGTTTTCACTGTGAGTGTAACGCGCCGTATAGTGTCGGCATCGATGGAAGAAAGTGTGTACCGTCTGTGGCTGTCGGGATGCCGGAACCTTTGCCCCTCGTCCGAACATCTTCTCGGTGCTACGCTCCGTGTGACACCGTGTCCTGGCTCTCGCGGAAGGTGAAGCAGCTCAACGACCAGCTCCACAGCACGCAAGCTGCCTTGAAGAAGTTGTTAGAGAACCCCGTGCTGACAGAAGACAGGAGTTTTGCTTATCGAGTACTGGATTCCACGGCTCCCTTAGAGGGCGGCTACTGCCGGTGTGAGAGAGGTCCTCGTGGTCCCGCCGGCCCACCGGGGATGGAAGGCCCGAAAGGCGACCCGGGACAACGCGGACCCCGAGGAGCTCGAGGTCCCAAGGGATCTTTGGACCTTATGCTGCTTTTACTAGCAGACATAAGACACGACATCCATAATCTTGAGGAAAGGGTTTATAAAGAAGGGGAACGACCCGAACGCTTCAACCTTCAGAAGGCATGGCGTCGACAACGGAAGCAAGAAAACTTAGAGAAGGAAAATAGGACGGAAGAAGAACTAGAAGCTTACACCTCGCCACCCGTCATTGAGGGGGCTGGAGACGTGACATCACGAGGTCCCGATGGTGACAAGCCCGAGTCCGGCACCACCAGGGATAATGTCCATAATGAGGATAATCAGAAGAGCACGGAATCCTTGGACCTTGCGGACATGGACGAGAAGCTGCGGCAGATCAGACTCCTGGCGCAGTCCACCAGCACCGACGACGACGACGAGCCCGACGGAGACTACGACTACAGCTTCTACTAG

Protein sequence:

>DPOGS215400-PA
MSAAVTMRAPLAPASLLLTHALLVLGRLHEGYYAEDGYHDDALDVVDLVSACPTDRLLRTRETCHVEGADVQCITLHCCEDYSYIAGRCIRNSVDACSLHLCEQACEVQEQRVWCSCHPGYRFDADSYNRKRQPYCVDIDECTINNGGCEHRCVNDPGGFHCECNAPYSVGIDGRKCVPSVAVGMPEPLPLVRTSSRCYAPCDTVSWLSRKVKQLNDQLHSTQAALKKLLENPVLTEDRSFAYRVLDSTAPLEGGYCRCERGPRGPAGPPGMEGPKGDPGQRGPRGARGPKGSLDLMLLLLADIRHDIHNLEERVYKEGERPERFNLQKAWRRQRKQENLEKENRTEEELEAYTSPPVIEGAGDVTSRGPDGDKPESGTTRDNVHNEDNQKSTESLDLADMDEKLRQIRLLAQSTSTDDDDEPDGDYDYSFY-