Monarch geneset OGS2.0

DPOGS214907
TranscriptDPOGS214907-TA2307 bp
ProteinDPOGS214907-PA768 aa
Genomic positionDPSCF300135 + 376346-403965
RNAseq coverage116x (Rank: top 58%)
Annotation
HeliconiusHMEL0171854e-14945.65% 
BombyxBGIBMGA003026-TA4e-23100.00% 
DrosophilaCG15765-PA2e-6446.98% 
EBI UniRef50UniRef50_D6WLC41e-8362.14%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WLC4_TRICA
NCBI RefSeqXP_309125.31e-6655.50%C-type lectin (AGAP000940-PA) [Anopheles gambiae str. PEST]
NCBI nr blastpgi|2700082244e-8362.14%hypothetical protein TcasGA2_TC014328 [Tribolium castaneum]
NCBI nr blastxgi|2700082243e-7829.56%hypothetical protein TcasGA2_TC014328 [Tribolium castaneum]
Group
Gene OntologyGO:00054883.8e-20binding
KEGG pathway 
InterPro domain[26-142] IPR0161873.8e-20C-type lectin fold
[27-144] IPR0161861.7e-17C-type lectin-like
[34-142] IPR0013043.8e-12C-type lectin
Orthology groupMCL35039 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214907-TA
ATGGTCGACACGTCAAGAATATATGAAGCCGTGAACATCAGCAACACATGGGTTCTGCCGGAGGAGGGTTTCCCAGTGTTCTACCGTTACTTCAGGGACCGCATCTCCTGGTACGAGGCGGACGCCGTCTGCCAGTTCCATCACGCCAATCTCGTCACTGTGGATACGACTGCTCAATACGATGCTGTCAGGGCATATCTTAAAGAGCTTGATATATCCAGTGCTGTATGGGTTGGACTCATCCGAAGCAACCCAGATGGAGATTTTACCTGGACGGACTACCGAGGTTTAAGTGGTGATGGATATTGGAGTGCTGCACCTGATACTCGTTCTGCTCCTTTGTGTGCTGCTGCCGATCCTGCTGCTGACTATCGTTGGGAAGCTCGAGCTTGTGGTGGACCCTCCGTTGCATCTTTCATTTGTGAACTACCTGTACCGCAATGGGCACTTGGTAATGAAGGCTGCATGGTACGGGCATTGCCAGCACTTACGATTCTTTATTTACCTGAGAGCGCCGCGGTACAACTAACTGCGGATTGCGGTTTGGCTGGAGTTAAACGGGTTCAATGCACTGGAAACGTGAAACGGGAGGACTTGCTTAAAGAGCTAGCATGTACTGAGGAAGATCAAACAACTTTAACTTCAATTGGAACGTCTCCAATTACATCGTGGCAAATGACCACAGATTCGATATTTAACAATCAATATTTTAATAATGAAGGTACTGGCACAGAAGAAAATGTTACCACAGAACACGAGGATGATATAAATCAATCAAGTCCTATTAAAGTTATATCTACCTCGTTTACCATCATTCCTGATACTTCTAATAAAGTTTCCTTAAAACCTTCTGTCACAGAACAATTGTTCAAAACACCTTTGCAGAAACATATTGATATTATAAATAATAATTATATTCAACAAAATGAAACACCGAAATTAGTTCACGATGATGACTTGTTCACTAATGAACAAAATAATCAACAGGGAAATGATAAAATGTTACAATATACAAAATTGCATGATGAACTTGCACGCCTAGGTAATTCAGATACTGTATTTAGTCAACCCACTGACCATTTTGTTCCTCCTTTGGTTATGGCTAAGGCAAAAATAAGCGATGATTTGACAGCATTATCTATTAAGGAAAAACTAGCTCAACAACTTTCTGAACAACAATTGAAGCATGAGATTCAACAAGAGTTTACGGCTACTGCTAATAGTTTTAACAAAGAGCTATATAGAACTGAAGCTATTAACAATCCATCAACTACATCTCTTCCTTTAATAAACATCAGTACCTCTTCTAAGGTAAATAAAAATAAAGACAATGAAAATGAGAATATTTCAATCAAAAAGACCAAAAACAAATACATGGAAACTAAACAAAAAAGTTTTAAAAATGGAGGAACAAAAGCTTCAAACACTATTTCGTCTGTCGAAACAACTCCCATCTTATCACAATTGAATACACAAAATAATATAATTACTAGCACCGTGGGTTCTATCACAGAACATACGCAAGGATATCCAACCAAACCACAATCGATAGAAGAAACATCGCTGGAACTTACAGTAATTTTAAGGGAACCAAATGAAAATACAAGCTATAACGATACAGTTGTTTCTACGAATAGACCACTTATTTCAAATAAACCACTAAAACAAAATAATATGGCAGATAATGAAACAGAAATACCCACCACGTCATCACATGTATTTCCAACGTCTACACAAGAGAATAACCTGAATGAAGATAAATTTAGAGATATGAATGTTTCTAATGATAAACATAGAGATACGGACAATTTTTTTTATGTGACAACTAAAAGTCTTCCTTCCAATGATAATGTCCAAACCAAAGCTAATGACGAAAGTAAAGACTTAGCGATGACAAACTCTCTGTCAGTATTATCTCTTATTAAAAAGTATAACAATAGTGCTACTAATATAAACTTGACGTCAGAAAACAGTACTATAGATGCATCTGTCACAATCGAAGTAATTGAGGATGCAAAAGCTACAAATAAAACTGTTGAACCGGAGAGTGAATTGAAAATTAACGATACAAGAAATAATTTTTTTAACTTTTCCGAAAACGAGCCTTTTAACGTAACAGTAAAAGAAGGATTTGACAGCATCATAGATTCCAATTTTGATTTAGACAGGAACATGACTTCAGATACAGATGATTTCCAATCTCCTCTCTTGTCTGCTGCTAGTGAACCGCTTCCAAAACCAAATAGATCCAGGCGAATTCAACAACAAACACGAAATAAGTTTAATCCGTTTCGTATTTTAGGCTAA

Protein sequence:

>DPOGS214907-PA
MVDTSRIYEAVNISNTWVLPEEGFPVFYRYFRDRISWYEADAVCQFHHANLVTVDTTAQYDAVRAYLKELDISSAVWVGLIRSNPDGDFTWTDYRGLSGDGYWSAAPDTRSAPLCAAADPAADYRWEARACGGPSVASFICELPVPQWALGNEGCMVRALPALTILYLPESAAVQLTADCGLAGVKRVQCTGNVKREDLLKELACTEEDQTTLTSIGTSPITSWQMTTDSIFNNQYFNNEGTGTEENVTTEHEDDINQSSPIKVISTSFTIIPDTSNKVSLKPSVTEQLFKTPLQKHIDIINNNYIQQNETPKLVHDDDLFTNEQNNQQGNDKMLQYTKLHDELARLGNSDTVFSQPTDHFVPPLVMAKAKISDDLTALSIKEKLAQQLSEQQLKHEIQQEFTATANSFNKELYRTEAINNPSTTSLPLINISTSSKVNKNKDNENENISIKKTKNKYMETKQKSFKNGGTKASNTISSVETTPILSQLNTQNNIITSTVGSITEHTQGYPTKPQSIEETSLELTVILREPNENTSYNDTVVSTNRPLISNKPLKQNNMADNETEIPTTSSHVFPTSTQENNLNEDKFRDMNVSNDKHRDTDNFFYVTTKSLPSNDNVQTKANDESKDLAMTNSLSVLSLIKKYNNSATNINLTSENSTIDASVTIEVIEDAKATNKTVEPESELKINDTRNNFFNFSENEPFNVTVKEGFDSIIDSNFDLDRNMTSDTDDFQSPLLSAASEPLPKPNRSRRIQQQTRNKFNPFRILG-