Monarch geneset OGS2.0

DPOGS203969
TranscriptDPOGS203969-TA1809 bp
ProteinDPOGS203969-PA602 aa
Genomic positionDPSCF300005 + 674059-689141
RNAseq coverage252x (Rank: top 41%)
Annotation
HeliconiusHMEL0103692e-15381.31% 
BombyxBGIBMGA000730-TA2e-1282.50% 
Drosophilamagu-PC7e-8932.64% 
EBI UniRef50UniRef50_UPI000224627F5e-10940.42%UPI000224627F related cluster n=1 Tax=unknown RepID=UPI000224627F
NCBI RefSeqXP_001687912.16e-11740.34%AGAP007489-PB [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3407169137e-12341.69%PREDICTED: LOW QUALITY PROTEIN: SPARC-related modular calcium-binding protein 1-like [Bombus terrestris]
NCBI nr blastxgi|3504205968e-12543.40%PREDICTED: SPARC-related modular calcium-binding protein 1-like [Bombus impatiens]
Group
Gene OntologyGO:00055094.2e-34calcium ion binding
GO:00071651.3e-11signal transduction
GO:00055781.3e-11proteinaceous extracellular matrix
GO:00055157.5e-09protein binding
KEGG pathway 
InterPro domain[405-571] IPR0119924.2e-34EF-hand-like domain
[381-450] IPR0007163.6e-18Thyroglobulin type-1
[501-565] IPR0195771.3e-11SPARC/Testican, calcium-binding domain
[34-85] IPR0023507.5e-09Proteinase inhibitor I1, Kazal
[51-85] IPR0114974.6e-08Protease inhibitor, Kazal-type
Orthology groupMCL12606 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203969-TA
ATGATTATGAATGACTTAGTGTTCCTCATTTTTTGTTTAAATTATATTTGTTACGTTAGTGGTGCTGATTCTGGCGAGAAGCCCAATGCGCAAAGCGAGACCTGTTACCATCGCGTGGCGGCGTGTGAAGCAAACACGGGTGCCGTGAATCGTCCAGTCTGCGGTTCCGACGGACATAACTACCCTTCAAAATGTCACTTAATGAAGGCACAGTGCTCAGGAGAACCTATTGTAATGGCCCACAGAGGGCCCTGTACAGACAGTCAAACTTCATGTATGGCGGTGCTGCGTTATGCATTGAAGCAAGGCGGTCGTCGTGCCACATTTGTGCCAAGGTGCCGCGCGGACGGCACTTATGCCGCCGTGCAATGTGCCGCTGCAGGTGCCGCAGCTGGCTGTTGGTGTGTCACCGCCGACGGGAAACCTCTGCCCGATACAGCTGTGAGGAATGGAAGGCCAGATTGTACGAGAACTGGCATTGATGTCTGTTTTATATGGCGACGAAATGTTATAAGGATAGAATTCATGACTAGTTTGAAGAACTGCGGGATGGCCGTCCCTTACAAGGTAGCCCAACCCCACGTCTGGTTCTTCTGTAAATCTCAAACAAAGCGGCGCTCTTCCGTTCGAGGTCAACGTAATAAGAAAAGTTGTACCAGAGTAGACAGAGCACAGTTCAATGGAAATCTTATCAAAATATTCAGTGGAGAATACGACCGAGCCCGAGCTGATGATGGAGGGGCCTCGGATCCTCGAGGAGTCGCTGATTGGAAATTCAGGGAACTGGATCGTGATAGAAGTGGGACGCTGCAGAAGTCTGAGTATCGCGGCTTGCGGCGGCTCATCAAAAAGGTGGTGAAACCAAAACGATGCGCTCGCGCATGGGCCCGCGGTTGTGACGGCGACGGGGACGGGGAGATCGCGCGCTCGGAGTGGGCCGCATGTCTCTTGGCCAGCCCGGACCCACCCGCTCCGGACTTCTCTCTCCGTTTCTTCATGTCGTTGAATGCAGACGACGATAGTGTTCCAGAGCCCGAACCGGACTACGAAGAGGAACCACCTCCAGACCCCAGTTCAGTATTGCCTGGCATAATGCGGAATTCCTTCGCTCCAGACGGTTCTGTCGTTAGAGAAGATGAAACAAACGACTGTCTCACAGACCGACAGGCCGTGCTAGATGAACAGAAAGCTGGCAGTGCTGTTTTATACGTGCCAGAGTGTACTGGTGACGGTCGGTATGCGCGCGCGCAGTGTTACCGCTCCACCGGCTACTGCTGGTGCGTCCATCAAGACACTGGCAAACCGATACCGGGATCGTCGGTCAAAGACGCTAAGCCGGACTGCGACGCCGCTCCACAACACGCCAGCCCAATGAGAGGTTGCCCAGAACCAATGAAGAGTCATTTTCTCCATGACCTGATAAGTTTCTTCATATCAAAGATGACTACTTCTATCAACGGCACGGGTCCAGGAGATGTGGTGAAATGGGGGGCGTCGAAGGAGGAGCAGGCAGCTACTTGGACCTATGTTATGTTAGATAAAGACAAAAACAAAGCCTTGGAAAGACGGGAGTGGAAAGCTTTCCACCAGCTGATATCAAACATGGAGCCATTGAGAAGATGTGGAAGAAAACTCCCTCGTTACTGTGACGTAAACCATGATTCCAAGATTAGTATTACAGAATGGATGGCCTGCTTGGAGGTCACACAGGCAGCGCACGGGCATACCACTGAAACAACAAAAGTTCCATCTAATCCAAGAAGAAAAGGACCCAATCCTCTCGAATCGATTCTAAAGGCCGACGACTAG

Protein sequence:

>DPOGS203969-PA
MIMNDLVFLIFCLNYICYVSGADSGEKPNAQSETCYHRVAACEANTGAVNRPVCGSDGHNYPSKCHLMKAQCSGEPIVMAHRGPCTDSQTSCMAVLRYALKQGGRRATFVPRCRADGTYAAVQCAAAGAAAGCWCVTADGKPLPDTAVRNGRPDCTRTGIDVCFIWRRNVIRIEFMTSLKNCGMAVPYKVAQPHVWFFCKSQTKRRSSVRGQRNKKSCTRVDRAQFNGNLIKIFSGEYDRARADDGGASDPRGVADWKFRELDRDRSGTLQKSEYRGLRRLIKKVVKPKRCARAWARGCDGDGDGEIARSEWAACLLASPDPPAPDFSLRFFMSLNADDDSVPEPEPDYEEEPPPDPSSVLPGIMRNSFAPDGSVVREDETNDCLTDRQAVLDEQKAGSAVLYVPECTGDGRYARAQCYRSTGYCWCVHQDTGKPIPGSSVKDAKPDCDAAPQHASPMRGCPEPMKSHFLHDLISFFISKMTTSINGTGPGDVVKWGASKEEQAATWTYVMLDKDKNKALERREWKAFHQLISNMEPLRRCGRKLPRYCDVNHDSKISITEWMACLEVTQAAHGHTTETTKVPSNPRRKGPNPLESILKADD-