Monarch geneset OGS2.0

DPOGS203716
TranscriptDPOGS203716-TA5760 bp
ProteinDPOGS203716-PA1919 aa
Genomic positionDPSCF300010 - 1287001-1315813
RNAseq coverage144x (Rank: top 54%)
Annotation
HeliconiusHMEL0059550.083.47% 
BombyxBGIBMGA003503-TA0.075.72% 
Drosophilatrol-PL6e-5024.80% 
EBI UniRef50UniRef50_D6W6H40.045.36%Putative uncharacterized protein n=3 Tax=Tribolium castaneum RepID=D6W6H4_TRICA
NCBI RefSeqXP_001811978.10.044.16%PREDICTED: similar to agrin [Tribolium castaneum]
NCBI nr blastpgi|2700146630.045.36%hypothetical protein TcasGA2_TC004709 [Tribolium castaneum]
NCBI nr blastxgi|1892336170.044.09%PREDICTED: similar to agrin [Tribolium castaneum]
Group
Gene OntologyGO:00055151.4e-16protein binding
KEGG pathwaytca:1001417630.0 
 K06254 (AGRN)maps-> ECM-receptor interaction
InterPro domain[1497-1661] IPR0089854.2e-50Concanavalin A-like lectin/glucanase
[1196-1403] IPR0133205.8e-49Concanavalin A-like lectin/glucanase, subgroup
[1513-1649] IPR0017917.7e-36Laminin G domain
[1521-1649] IPR0126804.6e-27Laminin G, subdomain 2
[149-194] IPR0023501.4e-16Proteinase inhibitor I1, Kazal
[860-909] IPR0020496.4e-13EGF-like, laminin
[443-486] IPR0114971.4e-09Protease inhibitor, Kazal-type
[1685-1717] IPR0062099.3e-08EGF
[1684-1719] IPR0062101.3e-06Epidermal growth factor-like
Orthology groupMCL11844 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203716-TA
ATGGTAGCATCGTATGGAGTATACGGTAATTCACAAGATGAGGAGAGCTCGGGATTTGCTCATCTTTCTGAAAAAAATATATCATATGGTCAGAGAAATGAAACTCCGGTTCAGGGTGTGATAGTTAATCAACAAGGAAATAGCGCAGTGGTACGGTCTACATCCGAACCTGATTCTTCGTGTAGCAAATGTTGTCCGTGGTCATCTAATGCCCCAGCAGCTAACTCAGAGGATTACTGTTGGATACACTACTGCCGACCAGCACTTATAGTTCTGTTGTTAATATTTTTTCTAATACTACTTGGCGTGTCAACAGGGTTCTTATTGACAAATAATTATTTACATTATCAACCTCGTCCACCTGCTCCTGAAGAAGCTTGTGAGAAGACATTTTGTCGATGGGGCGCAGAGTGTGTGTCATTAGGCGACGGACGCGCCCATTGCGCTTGTCCCACTTCCTGTCCGTCCTCTGCGTCACCCGTTTGTTCCACCGCTGGAAGAACTTACCGTAACCATTGTTTTTTGCGTAAAGAAGCCTGCGAGCGAAGGCTCAATTTACGCGTCAAGCATGAGGGTGAATGTGAAGCGGGAGATCCTTGTTCCGGACTAACCTGTCCCACTGGTGCGAGATGCGTAGTTACCTATGGCAACGCGGAATGTCGGTGTCCCCGTAATTGTCAACGCCGAAAAACCGTCTGCGGCAGCGATGGCCGGGAGTATCCATCTACTTGCCATCTGGATAAGCACGCTTGTGACAATCAGATAAACATCACTATCAAGTATCACGGCAAATGTGATCCTTGCCTGGAACATGAATGCGTTGATGGAGGAATATGTCAACTGAATGAAGTTCGTGCACCAGTATGTAGGTGTGGTCCACCTTGTAACTTAATCGTTCGACCAGGCTCGGCCGTTTGCGGTTCAGATTTCAAGGACTATGCTAGCGAGTGTTCGCTTCGCAGAGAATCATGCAGAACGAGACAGCAGCTTTCGATAGCTTATAGAGGAGAGTGTGCCTCAGCTCAACATCCCTGCGAATCTGTGAAGTGTGGCATCCGAGAGCGTTGCATATTGGATGCACGAGGTGTGGCTGTATGTGGCTGTGGCCCAGAATGTGAAGACGTTCTGCGACCAGTCTGTGGAAGCGACGATCGCACATATGCAAGCCCTTGTTTATTAGAGCGAACAGCATGTTTGGAAAACCGAGATGTGCGGGTCGCTTATATGGGAGCTTGTGGTCTGGAAAATCCATGCGCACGTGCAACTTGTCCTTGGGGTGGAGCATGTGTGACTCGCAGCGGCGCTGCTCACTGTTCGTGTCCAGTATGCGATGCAACTCTCTCTCCGGTTTGCGCATCTGACCACAACACTTACGGGAGCGAGTGCAAAATGCGAATGCACGCGTGTCAGGAGAGTCTTAAGGAGGGAGAGCTTCGGGTGCTCTATAACGGCACTTGTCAAACATGTGCTGATGTTATGTGCATGGGCGACGGGACTTGTGAAATGGACGAAACAGGCCGTCCAATCTGCCACTGCAACCACAATTGCACAGCACAGGAATCAGACGTCGTATGCGGTACTGACGGCCAGACTTATCAGTCACAATGTGAGCTCGATTTGACAGCGTGTCGCGACCAGAGCGAGCTGCGGTCGGCCTACAGTGGAGATTGTGCTCTCTGCAATGGCGTTCAGTGCTCCTATGGTGCTCATTGTGTAGCTGGAGAGTGTATTTGCCCTACGGATTGTAGCGGTGCTCCTCGAGAACCTGTATGTGGAAGCACTATGCAAACATATCAAAATGAATGTGAACTTCAGAAAGCAGCATGTAACCTTCCTCCTCCAACCAAACTTCATGTGATTTTTTATGGAGATTGCAAGGATAGGCTGGCAGTGGTTCCACCAATAGCTATGAGACCAACCCTCAATCAATTTACTAACTTTCGCTTTACCGCTTTACTAACTCAAACATTTGACATGGTGTGCGTAGCTACGACTGCGATGATAACAACTGAAGCCGATGAAAGTTCAACTGATATTGTGGAAGTGACACAGAAAGTAGATACGACAAGTCCTTCAGCTTGCCGCGATATCCGCTGTGACTTTGACGCCAGTTGTGAAATTGGTTACGACGGATATCCGCGTTGTTCTTGCTTATTTGAATGTCCAGCCGACGATGAATATTTTCCGGTTTGCGCTTCTGACTTTCGACTTTACCCAAGCTTATGCGCTATGCGGAAGGAAGGCTGTCAAAAACAGTTGGAACTTAGATTAAGACCTCTGGATTTATGTAAAGGTATGGAAGTTAGACCTTGCGGAAATAATAGAGCTATAATAGACAAATCGTCAGGTCTTGAAATAGATTGCGGCAACGGACCCCATCGTCAAGACTGTCCTGCTGGAAGTTACTGCCATATCACTTTGACCGCCGCAAAGTGCTGCCCTAAAAACGACACAAAACAAGTAGATGAAAGGAAGACGACTACCCACTGTTCGGAGAGCGCCTACGGTTGTTGTTCAGATGGCTCGACCGCAGCAAGTGGCCCGGGTGAAGAGGGTTGTCCAATTACGACTTCAACTTGTGGCTGCAACCGCCTAGGGTCAATATCTGATCGGTGTGATGATAGCGGCCAGTGTGTATGTCGCCCCGGTGTAGGGGGCCTTAAGTGTGATAGATGTGAGCCTGGATACTGGGGTCTGCCTCGCATAGGCTCAGGACATACTGGATGTATTCCATGCGGTTGTTCTGCATTTGGTTCTGTAAGAGAGGATTGTGAGCAAATGACCGGTCGATGCGTGTGCCGTACTGGGGTTCAGGGTCAGAAGTGTACCGTGTGCGCAGACCATCGCCGTCGCTTAGGACCTAACGGCTGTTCCGATCCGGAAAACGGCAGTTCGGTGGAGTCATGTGCGGATTTGTCATGTTATTTCGGAGCAGTGTGCACAGAACGTACGGGTGGAGCGTTGTGTGAGTGTGCTGCTGCCAACTGTCCTGACAGTGATCTTAACATGATGGTTTGCGGCAGTGACGGTAAAACTTACGAATCAGAATGCCACCTGAAATTGCAAGCCTGTCGCACTCAAGAGGATATTGTCGTTCAAGCATTTGGTCCTTGCAAGTTGTCCGAAGCATCGGGAACTGCGGGGCCACCACGACCGTCTTCACCGATACAGTTCACTCAACAAGATGATGGAGCAGCTTCGAAGTCTACAAGGCATCTGCTTAACCCTGACAAATATTACAATAAATATGATTGGACAAGGAAAGAAACTCCAAGTGATTTTGAAAACATTGTATCGGGCCAGAAAGTAAAAGGTTCGCAAACAGCGACAACAGCAACAGTGGGTGCGGTGGGAGCATTGCTTGGGGACTTATGTGCTGAAGACGCAGACTGTGCAGCTTTGCCGGGTGCTCTCTGTACACGTGGTGGCTGTGTGTGTCGTCCGGGTTACACACCCACTGCGCATAGGAAGGCTTGTATTGAAGAATTTCCTCAAGAAACCACAGAAGAATACAGTGCATGTTTGTCAGATCCTTGTTATAATTTTGGAACTTGCATTGACCTGCCAGGTTCCACTTATACTTGTGTCTGCTCCGAATCCTATACCGGCTCGAACTGTGAATCACTTATCAAAGACGGTCCACCGATTACGTACATCGAAACTCCATCATTTGTCGGTTCGTCTTACATACGCTTAAGACCACTGAAGGCTTACCATAAGTTGAACATCGATATCGAATTTAAGGCATTCTCAGAAAATGGGGTCTTATTGTACAATCAGCAGAAACTCGACGGAACGGGTGATTTTGTTTCGCTAGCGTTGGTAAACGGATATTTGGAGTTTAGGTACAATCTTGGGAATGGAGTAATAATTTTAACATCTTTAGAAAAGATATCATTGAACGAATATCATAAAGTGTCGGCTAAAAGATATCACCGCGATGGTATTCTGACGGTAGATGATATGGAAGATGTAGCAGGACAGTCAGATGGTAATTTAAAAGCTTTAGATCTTGCGGACGATGCCTTCATTGGTAGTGTACCAAGTAATTACACGAGGGTATTCGAAAATATTGGAACCCGAAACGGTTTTATTGGTTGTATTAAATATTTGAGAATTATTCGGCACCAAATAACGAAGAAATTGGGTCGTCCAGACTCATTAGTTGTCGCCATGGAAAACGTTCGAGAGTGTCAATCTAATCCTTGTATGAGTATGCCGTGTAGAAATGGGGCCACGTGTCAGGCTGTTGAAGGTTCGGTGACTGAATATACATGTAGCTGTCCTTTCGGATTTCAAGGAGCCAATTGTAACGAGAGAATAGATCCATGCGAATCCAATCCCTGTGGATATGATGAGGGGTTATTGTGTGATATTGGTCCTGACGGCGGACATATTTGTCGGTGTTTGTTTGGGGGAAATATCGAATCCGATGGAAATAATTGCAATAAAGATGTTAATGTTATCCATGAAACTTGGTCACCTCAATTCAATGGTACTAGCTATATCGAGTTACCGCCGCTCGAGGGCTTGGGAAAAGCATTCCGTATCGAAATTTGGTTTTTAACGAACCGTTTTTCCGGAATGCTTCTTTATACCGGGCAGTCAAATAAAGCCAAGGGGGATTTTATAGCGATTAACTTGGTTAATGGATATCTACAGTTTAGGTATAATTTAGGAAGTGGAATTGCAAACATCACTTCCCCAACACCGATAACTAAGGGGCAATGGCATCGCGTTCGTGTAAGCCGAGTTGGCAGACATGGTAGTCTACAGCTTGATCAGTTGCCTGTACAGCGCGGCCTTTCCCCACCACCCCTCACTCACTTGGAGCTTAATCTCCCTCTATTTATTGGTTCTTTGCCTGCTTACGTCCGTCCTCACAAAATGTCCGGGGTGACCAGCAGTTTTATCGGGGTTATGCAACAGGTATTCGTAAACGGCAATCCACTATCGCTATACAGTGAGGATACAGCAAAATGCTTTGTAGTTGCTGAGGAAGAGCGACTTCCGTGTGCTACGAGCGGCGTCACTAAATACACCGGACCGCCGTGTGGTGATGATCTAACCCCTTGTAAAAATAATGGTTCGTGCGTACCGTTATTGAACGAATATAAATGTATATGCCCAGACGGATATCAAGGACGGAACTGTGAGCTCCAATTAAAAGTAGAGATGTTAAACGATGGAGCGCCAATTAAGTTCGACGGAAATAATTACTACTCCTACAGAAGTCGTGGCGGCCGTAGGAACCGTGGATTTCGTGGTATTAGATATGAAATAAAGTTTCGGACTTATAATAACTCCGGTCTTTTAATGTGGAGACGAAAAATTGGTATACGACCCCGGGACTTCATCGGACTCGGATTAAGTAATGGAAAATTACAATTAATATACACTGACACAGATGTAAAAGAGAACAGTTTGGCTTTGAACGAGGAGTGGTTTCAAAGTGTTGAATCGAAGGAGAGAGTAGATGATGGACGTTGGCATACAGCGACTGTTAGAAGAAGGAAGCGGCTCGCAATGTTGCAAGTAGACGATACACCGCCTGTGAGGGGGTACTCGCAATCATTGCTGGTACCTTCGAAAGCTAATCCAAAGTTATGGATAGGAGGATCTCCATCGCTTCCTTTAGGATTGCCAGGGGACCTTTACTCAGGATTCCGAGGTTGTATCGCCAGCGTGAAGTCTAACGGTAGGCACATCGACATTACAACACCTATACGACCGACGACTACAATACGATATTGTGATTAA

Protein sequence:

>DPOGS203716-PA
MVASYGVYGNSQDEESSGFAHLSEKNISYGQRNETPVQGVIVNQQGNSAVVRSTSEPDSSCSKCCPWSSNAPAANSEDYCWIHYCRPALIVLLLIFFLILLGVSTGFLLTNNYLHYQPRPPAPEEACEKTFCRWGAECVSLGDGRAHCACPTSCPSSASPVCSTAGRTYRNHCFLRKEACERRLNLRVKHEGECEAGDPCSGLTCPTGARCVVTYGNAECRCPRNCQRRKTVCGSDGREYPSTCHLDKHACDNQINITIKYHGKCDPCLEHECVDGGICQLNEVRAPVCRCGPPCNLIVRPGSAVCGSDFKDYASECSLRRESCRTRQQLSIAYRGECASAQHPCESVKCGIRERCILDARGVAVCGCGPECEDVLRPVCGSDDRTYASPCLLERTACLENRDVRVAYMGACGLENPCARATCPWGGACVTRSGAAHCSCPVCDATLSPVCASDHNTYGSECKMRMHACQESLKEGELRVLYNGTCQTCADVMCMGDGTCEMDETGRPICHCNHNCTAQESDVVCGTDGQTYQSQCELDLTACRDQSELRSAYSGDCALCNGVQCSYGAHCVAGECICPTDCSGAPREPVCGSTMQTYQNECELQKAACNLPPPTKLHVIFYGDCKDRLAVVPPIAMRPTLNQFTNFRFTALLTQTFDMVCVATTAMITTEADESSTDIVEVTQKVDTTSPSACRDIRCDFDASCEIGYDGYPRCSCLFECPADDEYFPVCASDFRLYPSLCAMRKEGCQKQLELRLRPLDLCKGMEVRPCGNNRAIIDKSSGLEIDCGNGPHRQDCPAGSYCHITLTAAKCCPKNDTKQVDERKTTTHCSESAYGCCSDGSTAASGPGEEGCPITTSTCGCNRLGSISDRCDDSGQCVCRPGVGGLKCDRCEPGYWGLPRIGSGHTGCIPCGCSAFGSVREDCEQMTGRCVCRTGVQGQKCTVCADHRRRLGPNGCSDPENGSSVESCADLSCYFGAVCTERTGGALCECAAANCPDSDLNMMVCGSDGKTYESECHLKLQACRTQEDIVVQAFGPCKLSEASGTAGPPRPSSPIQFTQQDDGAASKSTRHLLNPDKYYNKYDWTRKETPSDFENIVSGQKVKGSQTATTATVGAVGALLGDLCAEDADCAALPGALCTRGGCVCRPGYTPTAHRKACIEEFPQETTEEYSACLSDPCYNFGTCIDLPGSTYTCVCSESYTGSNCESLIKDGPPITYIETPSFVGSSYIRLRPLKAYHKLNIDIEFKAFSENGVLLYNQQKLDGTGDFVSLALVNGYLEFRYNLGNGVIILTSLEKISLNEYHKVSAKRYHRDGILTVDDMEDVAGQSDGNLKALDLADDAFIGSVPSNYTRVFENIGTRNGFIGCIKYLRIIRHQITKKLGRPDSLVVAMENVRECQSNPCMSMPCRNGATCQAVEGSVTEYTCSCPFGFQGANCNERIDPCESNPCGYDEGLLCDIGPDGGHICRCLFGGNIESDGNNCNKDVNVIHETWSPQFNGTSYIELPPLEGLGKAFRIEIWFLTNRFSGMLLYTGQSNKAKGDFIAINLVNGYLQFRYNLGSGIANITSPTPITKGQWHRVRVSRVGRHGSLQLDQLPVQRGLSPPPLTHLELNLPLFIGSLPAYVRPHKMSGVTSSFIGVMQQVFVNGNPLSLYSEDTAKCFVVAEEERLPCATSGVTKYTGPPCGDDLTPCKNNGSCVPLLNEYKCICPDGYQGRNCELQLKVEMLNDGAPIKFDGNNYYSYRSRGGRRNRGFRGIRYEIKFRTYNNSGLLMWRRKIGIRPRDFIGLGLSNGKLQLIYTDTDVKENSLALNEEWFQSVESKERVDDGRWHTATVRRRKRLAMLQVDDTPPVRGYSQSLLVPSKANPKLWIGGSPSLPLGLPGDLYSGFRGCIASVKSNGRHIDITTPIRPTTTIRYCD-