Monarch geneset OGS2.0

DPOGS206045
TranscriptDPOGS206045-TA1338 bp
ProteinDPOGS206045-PA445 aa
Genomic positionDPSCF300028 - 1256035-1260539
RNAseq coverage17x (Rank: top 81%)
Annotation
HeliconiusHMEL0087770.067.86% 
BombyxBGIBMGA000717-TA1e-14356.03% 
DrosophilaCG1791-PA4e-5036.69% 
EBI UniRef50UniRef50_D6WEJ62e-6654.38%Angiopoietin-like 1 n=3 Tax=Tribolium castaneum RepID=D6WEJ6_TRICA
NCBI RefSeqXP_973843.13e-6754.38%PREDICTED: similar to AGAP004918-PA [Tribolium castaneum]
NCBI nr blastpgi|910783566e-6654.38%PREDICTED: similar to AGAP004918-PA [Tribolium castaneum]
NCBI nr blastxgi|2700048661e-6554.38%angiopoietin-like 1 precursor [Tribolium castaneum]
Group
Gene OntologyGO:00071652.4e-91signal transduction
GO:00051022.4e-91receptor binding
KEGG pathway 
InterPro domain[171-385] IPR0021812.4e-91Fibrinogen, alpha/beta/gamma chain, C-terminal globular
[173-310] IPR0147165.1e-52Fibrinogen, alpha/beta/gamma chain, C-terminal globular, subdomain 1
[311-375] IPR0147151e-22Fibrinogen, alpha/beta/gamma chain, C-terminal globular, subdomain 2
Orthology groupMCL19140 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206045-TA
ATGTCCAGCGAGAAATACTTAAAAATAATAGAGACGATTGAAGATAGAATGACTCATTTAGAGTCAATATTTCATGAGAGGTCCAATTCAATACTCAAATATCTGTTGGAAGTCTTAAGAGCCGTGAAGAACCCACCAGCCGAAGTGATGGAGAGAGCCTTCAAAAATCTCAAACACGACCTGGACAGGTTGAAGTATTCAGTGTCACAGAAAAGTGGGACTCCGCCTAAATTAAGAGTGGAGGACGGCAACTGCAACTTGGAGGGTGCTCTCGACAGTCGCCTCTCGTTTTTGGAAACGAACATGAAATCTGTGTTGAATGGTGTCGAAGCTATAACAGCGGTCATATCGGAAGTGAAAAACCGGCAGAAACTTAGGACATTTAACAAAAACGATCAAATCACTGGAAACTTAGACGCTACAACTTTAATCAGTGAATTCAGAAGAGTATTGATGGAACAAAAACCTAAGAAATGTGATTGCAAATTGGGTCGTGTCGATCGTTCAGAGAGGTATCCCACCGACTGTCAAGAGATCCAAGCCCAAGGTTTTAACGTCACTGGCATATACAAGATCAAGCCCGAAGACATGGAACCTTTCTATGTGCTGTGTGATCTTAACACTGTTGGTGGAGGGTGGACGGTCATACAAAATCGTTTCGACGGATCCCAAGATTTTTACAAAAATTGGAACGAATACAAACACGGTTTTGGTAACTTGGCCAGCGAGTTCTGGCTCGGTTTGGAGAAAGTGTATTATCTAACTAATCAGAAATTATATGAGCTGAGAGTGGAAATGGAGACACAAATCGGACAGGAGGCTTCGGCTACATTTTCCGTTTTCACTATCGGACCTGAATACGAGTCCTACAGGATAAGTACGCTAGGTACTTATCGAGGGAATGCCGGTGACTCGTTATCGTATCACGCCGGTCAAAAATTTTCAACCTATGAAATTGATAACGACGAATGGAAAGATGGTTCGTGTGCGGTTGAACATGGCGGTGCCTGGTGGTATAAGGAATGTGACAAAAGTAACTTGAACGGTAAATACATGAGTGGAACGGAGGAGAACAACGGTCAAGCAGTTTATTGGATCTCATTCAAAGGACCGAACTCGCCTCTATCGAAGACCAGGATGATGATAAGACCTCTGCCGGCCAGCCGACCACAGGAATACACCGAACAATTGCGGAAGCTATCTGAGAACCCAAAACCGGATGTAAAGCGAGTCCGCGGCAAGGAAATGAAGAGTGCTTACGACGGCGGTCGAGCCAGAGCGCCCTACAGATACGAGGATAGCGTGCGCCAGGAAGTCTTCTTCCCGAACTATACGTAA

Protein sequence:

>DPOGS206045-PA
MSSEKYLKIIETIEDRMTHLESIFHERSNSILKYLLEVLRAVKNPPAEVMERAFKNLKHDLDRLKYSVSQKSGTPPKLRVEDGNCNLEGALDSRLSFLETNMKSVLNGVEAITAVISEVKNRQKLRTFNKNDQITGNLDATTLISEFRRVLMEQKPKKCDCKLGRVDRSERYPTDCQEIQAQGFNVTGIYKIKPEDMEPFYVLCDLNTVGGGWTVIQNRFDGSQDFYKNWNEYKHGFGNLASEFWLGLEKVYYLTNQKLYELRVEMETQIGQEASATFSVFTIGPEYESYRISTLGTYRGNAGDSLSYHAGQKFSTYEIDNDEWKDGSCAVEHGGAWWYKECDKSNLNGKYMSGTEENNGQAVYWISFKGPNSPLSKTRMMIRPLPASRPQEYTEQLRKLSENPKPDVKRVRGKEMKSAYDGGRARAPYRYEDSVRQEVFFPNYT-