Monarch geneset OGS2.0

DPOGS211056
TranscriptDPOGS211056-TA6354 bp
ProteinDPOGS211056-PA2117 aa
Genomic positionDPSCF300446 + 60609-114670
RNAseq coverage77x (Rank: top 65%)
Annotation
HeliconiusHMEL0077940.091.58% 
BombyxBGIBMGA009611-TA0.075.21% 
Drosophilasli-PC0.050.96% 
EBI UniRef50UniRef50_Q7QCT20.054.40%AGAP002793-PA n=9 Tax=Pancrustacea RepID=Q7QCT2_ANOGA
NCBI RefSeqXP_972265.10.053.12%PREDICTED: similar to AGAP002793-PA [Tribolium castaneum]
NCBI nr blastpgi|910780860.053.12%PREDICTED: similar to AGAP002793-PA [Tribolium castaneum]
NCBI nr blastxgi|910780860.053.12%PREDICTED: similar to AGAP002793-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055096.8e-10calcium ion binding
GO:00055157.2e-07protein binding
KEGG pathwaydme:Dmel_CG83550.0 
 K06850 (SLIT3)maps-> Axon guidance
InterPro domain[1729-1906] IPR0089851.4e-42Concanavalin A-like lectin/glucanase
[1929-1939] IPR0133201.2e-41Concanavalin A-like lectin/glucanase, subgroup
[1753-1889] IPR0017912.6e-31Laminin G domain
[1761-1891] IPR0126794.7e-28Laminin G, subdomain 1
[1419-1470] IPR0004833.3e-11Cysteine-rich flanking region, C-terminal domain
[2029-2109] IPR0062072.7e-10Cystine knot, C-terminal
[1557-1593] IPR0018816.8e-10EGF-like calcium-binding
[1599-1631] IPR0062097.2e-07EGF
[1560-1593] IPR0062101.2e-06Epidermal growth factor-like
[694-726] IPR0003722.5e-06Leucine-rich repeat-containing N-terminal
Orthology groupMCL10589 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211056-TA
ATGGCGGCTGTAATTAACCGGCATTGGGGTTATATAATATGCCGTAAGATGTTGTTGATAGTGCTGTCGTGCGTTCTGGCCGTGGGCGCCGCCTGTCCGTGGGCGTGCTCCTGCCGGCCAGGTGCCGCTGACTGCGCCCATCGAGCCCTGCTCCACGCACCTCGAAGACTGCCAGTAGATGCCCATAGGCTAGATCTTCAAGGCAACAATATAAGCATCATCTTCCAGAGCGACTTCCAGAACCTGAAGGAACTTAAGATTTTACAATTATCAGAGAACCAAATTCACACTATAGAGAGAGATGCGTTCTTGGAGCTGAACGTACTGGAGCGACTGAAACTGAGCAATAATCGACTCGGCCACATACCTGATGGTATTTTTCTGAGATTGAGGCATCTACAACGTTTGGACTTAAGTCGCAACGAACTGACCGCTATCAGCAGACGGACCTTCCGAGGTCTGACCGCGCTGAAAAGTCTACACTTGGATGGAAACCAGCTCAAGTGCATTGATGAAAAGGCGCTGGAACATTTGAAAAGCTTGGAAGTCTTAACCCTGAATAATAATAACCTGACGTACCTATCACTGGAGGCGGTGTCTGTCGCTCGTCTCCACACCCTGCGACTGTCGGACAATCCGATCGTGTGCGACTGTCGTGTCGCACGTCTGGCTGCGGCCGTACGCGCCGCTGGAATACTCGGACTGGGAGCGAGATGTCAGGCTCCAGCAACCTTGAGAGGAGCCATGCTGACGGAGTTGGAAGCCCAAGATTTAATATGCAACGGACCTAACTCCATAGCGGAGTGTTCGTCAGAGCCGCGCTGTCCGCCCGCGTGCCGTTGTTCCACCGACGGCACCGTCGACTGCCGAGAGAAACTCCTCACAGAGCTGCCCACCACCATACCGCACAGAGCCACTGAGATCCGTTTAGAACAGAACGAGATAACTGAAGTAGGCGCTGGCGCCTTCTCGGCTGTGAAGAGAGTCGCTCGCATCGACCTGTCCAACAACAAGATCGCCAAGATGGCCGGCGACGCTTTCAACGGCCTAACACACCTGACTTCATTAGTTCTCTATGGGAACAAGATAAAAGACCTGCCATCAGGGATCTTCCACGGGCTGACATCGTTACAACTGCTTTTGCTCAATTCAAACGAGATAAGTTGCGTCCGTAAAGACACGTTCAGGGACCTGCAGAGTCTAAAACTACTATCTCTCTATGACAACAACATCAGGTCGCTTCCGAACGGAACCTTCGATTCACTCACCGGGATACAAACTTTGCACTTGGGTCGTAATCCGTTTTCCTGCGACTGCTCGCTGCGCTGGCTGGGCGCCTACCTCCGCCGGAACCCCATCGAGACCTCGGGCGCCAAATGCGATTCCCCCAAGAGGATGAACAGGAAACGAATCGATGCTCTGAGGGACGAGAACTTCAAATGCAAACCCGGTGAGGAGCCTCCGGACGCGTGCGGCGACGCCCCGCCCTGCCCCGACACGTGCGCCTGCTCGGGCGCCGGCCGCGCGCTGCGGGTGGCGTGCGCCCGCGCAGGACTCGCCGACGTGCCCAGAGACCTGCCGCTCACAACACACGCACTGATCATGCCGGACAACAATCTCGGTCAAATTAAATCTGACGGACTATTCGGGAGACTGCCGGACCTCGCGAAGCTGGACCTGAGGAACAATGGTATAACAGTGATAGAGGACAACGCGTTCGACGGCGCGGTGGCCATGAGGGAGCTCTCGCTGGATGGGAACCTGCTGCAGACTGTGGGCGACAAAATGTTCTTCGGACTGCACAGCCTTACTCTGCTGTCCCTGACTGATAACAAAATAAGGTGCATCACCCCCGGCTCCTTCGACCACCTGACGATGCTGTCGACTCTTTCGTTGGCCAACAACCCTATCGCGTGTAACTGTCACATGTCCTGGTTGCCGGGCTGGTTGCGAGGGCGGCGACTCTCCTCGGGGGTGACGTGCGCCCTGCCCCTCGGCCTGCGCGGCACTGAGCTCCAACAGCTGGAAGTGGTCGACTTCAAATGTGCCCCGGACGAGCAGGGCTGCCTCCCGGCGGACTACTGCCCCGAGCGCTGCGCCTGCGCCGGGACCGTGGTGAGATGTGCTCGAGCCAGGCTGACCTCGCTTCCGCCGAGAATACCGCCCTACACCACGGAACTGGACTTAAGTCGCAACGAACTGACCGCTATCAGCAGACGGACCTTCCGAGGTCTGACCGCGCTGAAAAGTCTACACTTGGATGGAAACCAGCTCAAGTGCATTGATGAAAAGGCGCTGGAACATTTGAAAAGCTTGGAAGTCTTAACCCTGAATAATAATAACCTGACGTACCTATCACTGGAGGCGGTGTCTGTCGCTCGTCTCCACACCCTGCGACTGTCGGACAATCCGATCGTGTGCGACTGTCGTGTCGCACGTCTGGCTGCGGCCGTACGCGCCGCTGGAATACTCGGACTGGGAGCGAGATGTCAGGCTCCAGCAACCTTGAGAGGAGCCATGCTGACGGAGTTGGAAGCCCAAGATTTAATATGCAACGGACCTAACTCCATAGCGGAGTGTTCGTCAGAGCCGCGCTGCCCGCCCGCGTGCCGTTGTTCCACCGACGGCACCGTCGACTGCCGAGAGAAACTCCTCACAGAGCTGCCCACCACCATACCGCACAGAGCCACTGAGATCCGTTTAGAACAGAACGAGATAACTGAAGTAGGCGCTGGCGCCTTCTCGGCTGTGAAGAGAGTCGCTCGCATCGACCTGTCCAACAACAAGATCGCCAAGATGGCCGGCGACGCTTTCAACGGCCTAACACACCTGACTTCATTAGTTCTCTATGGGAACAAGATAAAAGACCTGCCATCAGGGATCTTCCACGGGCTGACATCGTTACAACTGCTTTTGCTCAATTCAAACGAGATAAGTTGCGTCCGTAAAGACACGTTCAGGGACCTGCAGAGTCTAAAACTACTATCTCTCTATGACAACAACATCAGGTCGCTTCCGAACGGAACCTTCGATTCACTCACCGGGATACAAACTTTGCACTTGGGTCGTAATCCGTTTTCCTGCGACTGCTCGCTGCGCTGGCTGGGCGCCTACCTCCGCCGGAACCCCATCGAGACCTCGGGCGCCAAATGCGATTCCCCCAAGAGGATGAACAGGAAACGAATCGACGCTCTGAGGGACGAGAACTTCAAATGTAAACCCGGTGAGGAGCCTCCGGACGCGTGCGGCGACGCCCCGCCCTGCCCCGACACGTGCGCCTGCTCGGGCGCCGGCCGCGCGCTGCGGGTGGCGTGCGCCCGCGCAGGACTCGCCGACGTGCCCAGAGACCTGCCGCTCACAACACACGCACTGATCATGCCGGACAACAATCTCGGTCAAATTAAATCTGACGGACTATTCGGGAGACTGCCGGACCTCGCGAAGCTGGACCTGAGGAACAATGGTATAACAGTGATAGAGGACAACGCGTTCGACGGCGCGGTGGCCATGAGGGAGCTCTCGCTGGATGGGAACCTGCTGCAGACTGTGGGCGACAAAATGTTCTTCGGACTGCACAGCCTTACTCTGCTGTCCCTGACTGATAACAAAATAAGGTGCATCACCCCCGGCTCCTTCGACCACCTGACGATGCTGTCGACTCTTTCGTTGGCCAACAACCCTATCGCGTGTAACTGTCACATGTCCTGGTTGCCGGGCTGGTTGCGAGGGCGGCGACTCTCCTCGGGGGTGACGTGCGCCCTGCCCCTCGGCCTGCGCGGCACTGAGCTCCAACAGCTGGAAGTGGTCGACTTCAAATGTGCCCCGGACGAGCAGGGCTGCCTCCCGGCGGACTACTGTCCCGAGCGCTGCGCTTGCGCCGGGACCGTGGTGAGATGTGCTCGAGCCCGACTGACCTCGCTTCCGCCGAGAATACCGCCCTACACCACGGAACTGTACTTGGAGTCCAACGAGATCACCAGCATCTCCTCGGAGCAGGTCCGTCACTTGACGCAGCTGACGAGGCTGGACCTCTCCAACAACAGGATCGCAGTGCTCTCCAACAACACCTTCGAAGGTCTCAGCAAGCTCTCCACGCTCATCGTCAGTTACAACAGGCTGAGATGCGTTCAGCGGGACGCGCTCAAGGGTCTGACGCAGCTCCGCGTGCTGTCTCTCCACGGCAACAACATCTCCACTCTGGCGGACGGAGTCTTCAGAGACCTGGAATCCATCTCACACGTTGCCCTGGGGTCCAATCCCTTGTACTGCGACTGCAGCGCGCGCTGGCTGTCCGAGTGGGTCAAAGTGTCCGGGGAGTATGTGGAGGCGGGCATCGCTCGCTGTGTGGCCCCACCGCCCATGAGGGACAAACTGTTGCTCAGCACAGCTACGAGCGCTTTCGTTTGTAACGGTAACCCCCCACCGGAAGTCGTGTCCAAATGCGACCGCTGCTACCGGAACCCGTGTCTCAACCAAGGCACGTGTCGCTCCACCACATCCGGAGGCTTCGCCTGTTCCTGTGCCCGAGGCTTCCACGGAGAAACTTGTCAGTATGAGATAGACGCGTGCTACGGCTCTCCCTGCGCCCAGGGAACCTGCCAGCTACTAGAAGAGGGGAGGTTCCATTGTGCGTGTCATGCTGGATACACAGGCGTGAGGTGTGAGGTGGACATTGACGATTGTGTCGGCCACCGCTGCAAGAACAACGCGACCTGTGTGGACCACCTGGAGGGCTACACCTGCAAGTGCGCTCCAGGTTTCATGGGCGAGTTCTGCGAGAAGAAGATACCGTTCTGCACGAGCGGCTTCAACCCTTGCGCCAACGGAGCCTCGTGCGTGGACCTCGGCAGCCACTACACGTGCGCGTGCCCCAAGGGCTACTCGGGACAGAACTGCACTATCAACGCGGACGACTGTATGAACCACATGTGCCAGAACGGCGCTACTTGTATGGACGGGCTGGACGAGTACCGCTGCGCATGCGCCGCGGGGTACGCGGGCCGGTACTGCGAGGCGGCGCCCCACGCGGCTCTGGGGACTTCGCCCTGCGCTCACCACGACTGTGTGCACGGAGTCTGCTATCTGCCGGCCCTGGCGCTACACGATGACATCATGATGGAGAGACCTCTGCTGGCGCCGCCCGACTACCTCTGCAAGTGCGCGCCGGGATACTCGGGTCGGTACTGTGAATACCTGACCTCTCTGACCTTCAACCACAACGACTCTCTCGTCGAACTGGAACCGCTAAGGACCTCGCCGCAAGCTAACGTCACACTCGTTTTTAGCACAAAACAGTTGCACGGAGTCCTCATGTACTTCGGAGACAACGAACACTTGGCCGTGGAACTGTTCAACGGAAGAATTAGAGTTAGCTACGACGTCGGCAACCATCCCACGTCCACCATGTACAGCTTCGAAATGGTGTCCGACGGTAACTACCACAAAGCTGAGCTTTTGGCCATCAAGAAGAATTTCACTCTCCGCGTCGACGACGGGCCCGCCAGGTCCATCATAAACGAAGGCAGCAACGAATTCCTGCGCCTGGAGCGCCCGATGTTCGTGGGAGGGGTGCCGCCGGATGTCGCCAAGGACGCCTTCAGCAAGTGGCACCTCCGAAACATAACTAGCTTCAAAGGGTGTCTCAAAGAGGCGTGGATCAACCACAAACGTGTCGACTTCGTGAACGCAGCTCGAGCGACTCGGACCACCGCGGGTTGCGGGGGCGGAGGCCTAGCCGGGCCCGGGGCCGAGGAGCCCCCGGCGCCCCCGCACGCTCTCCAGGAAGACGGCGCGCACGAACCAGACCCCTGCGTGCCGAATCCTTGCGCTCGCGGCGGGCGCTGCGTCCGCGAGGCGGGCTCCCGGTCCGACTACACGTGTCGCTGCCGCGCCGGCACCGCGGGGGCGCAGTGCGAGCGCCGAGCGTCTGTTGGCGGTACACCAGTCATCACTCAGTCGAAACTACCGCCTCGAAAACAGGTCATCAACAACAACAACGTCCAAGCATCGCCAGCTGCACCTTCGCCGAAGCAGTATCCAGACCAAGCCTCTGCACCACAAATGCCATCAACTGCAGCCTGCAGAAAGGAAGCGACGCGTGAATTTATAACAGAGGGCTCGTGTAGGAGCCGCAGGCCTGTCCGAGGGGCCCGCTGTACCGCCCGGACTGTGGGGCCCGGAGACAGTGGGGGGGCCTGTCCGCGAGCGACGTGCTGCGCTCCAAGGAAGACTAAGAAAAGGAAGATTCGACTCGTCTGCTCGGACGGCACGCGGTACACCAAAGACATAGAGATAGTGCGGAAATGCGCCTGCGGGAAGAAATGTCCAGCGAGAAACACACCATTCCTACACTAG

Protein sequence:

>DPOGS211056-PA
MAAVINRHWGYIICRKMLLIVLSCVLAVGAACPWACSCRPGAADCAHRALLHAPRRLPVDAHRLDLQGNNISIIFQSDFQNLKELKILQLSENQIHTIERDAFLELNVLERLKLSNNRLGHIPDGIFLRLRHLQRLDLSRNELTAISRRTFRGLTALKSLHLDGNQLKCIDEKALEHLKSLEVLTLNNNNLTYLSLEAVSVARLHTLRLSDNPIVCDCRVARLAAAVRAAGILGLGARCQAPATLRGAMLTELEAQDLICNGPNSIAECSSEPRCPPACRCSTDGTVDCREKLLTELPTTIPHRATEIRLEQNEITEVGAGAFSAVKRVARIDLSNNKIAKMAGDAFNGLTHLTSLVLYGNKIKDLPSGIFHGLTSLQLLLLNSNEISCVRKDTFRDLQSLKLLSLYDNNIRSLPNGTFDSLTGIQTLHLGRNPFSCDCSLRWLGAYLRRNPIETSGAKCDSPKRMNRKRIDALRDENFKCKPGEEPPDACGDAPPCPDTCACSGAGRALRVACARAGLADVPRDLPLTTHALIMPDNNLGQIKSDGLFGRLPDLAKLDLRNNGITVIEDNAFDGAVAMRELSLDGNLLQTVGDKMFFGLHSLTLLSLTDNKIRCITPGSFDHLTMLSTLSLANNPIACNCHMSWLPGWLRGRRLSSGVTCALPLGLRGTELQQLEVVDFKCAPDEQGCLPADYCPERCACAGTVVRCARARLTSLPPRIPPYTTELDLSRNELTAISRRTFRGLTALKSLHLDGNQLKCIDEKALEHLKSLEVLTLNNNNLTYLSLEAVSVARLHTLRLSDNPIVCDCRVARLAAAVRAAGILGLGARCQAPATLRGAMLTELEAQDLICNGPNSIAECSSEPRCPPACRCSTDGTVDCREKLLTELPTTIPHRATEIRLEQNEITEVGAGAFSAVKRVARIDLSNNKIAKMAGDAFNGLTHLTSLVLYGNKIKDLPSGIFHGLTSLQLLLLNSNEISCVRKDTFRDLQSLKLLSLYDNNIRSLPNGTFDSLTGIQTLHLGRNPFSCDCSLRWLGAYLRRNPIETSGAKCDSPKRMNRKRIDALRDENFKCKPGEEPPDACGDAPPCPDTCACSGAGRALRVACARAGLADVPRDLPLTTHALIMPDNNLGQIKSDGLFGRLPDLAKLDLRNNGITVIEDNAFDGAVAMRELSLDGNLLQTVGDKMFFGLHSLTLLSLTDNKIRCITPGSFDHLTMLSTLSLANNPIACNCHMSWLPGWLRGRRLSSGVTCALPLGLRGTELQQLEVVDFKCAPDEQGCLPADYCPERCACAGTVVRCARARLTSLPPRIPPYTTELYLESNEITSISSEQVRHLTQLTRLDLSNNRIAVLSNNTFEGLSKLSTLIVSYNRLRCVQRDALKGLTQLRVLSLHGNNISTLADGVFRDLESISHVALGSNPLYCDCSARWLSEWVKVSGEYVEAGIARCVAPPPMRDKLLLSTATSAFVCNGNPPPEVVSKCDRCYRNPCLNQGTCRSTTSGGFACSCARGFHGETCQYEIDACYGSPCAQGTCQLLEEGRFHCACHAGYTGVRCEVDIDDCVGHRCKNNATCVDHLEGYTCKCAPGFMGEFCEKKIPFCTSGFNPCANGASCVDLGSHYTCACPKGYSGQNCTINADDCMNHMCQNGATCMDGLDEYRCACAAGYAGRYCEAAPHAALGTSPCAHHDCVHGVCYLPALALHDDIMMERPLLAPPDYLCKCAPGYSGRYCEYLTSLTFNHNDSLVELEPLRTSPQANVTLVFSTKQLHGVLMYFGDNEHLAVELFNGRIRVSYDVGNHPTSTMYSFEMVSDGNYHKAELLAIKKNFTLRVDDGPARSIINEGSNEFLRLERPMFVGGVPPDVAKDAFSKWHLRNITSFKGCLKEAWINHKRVDFVNAARATRTTAGCGGGGLAGPGAEEPPAPPHALQEDGAHEPDPCVPNPCARGGRCVREAGSRSDYTCRCRAGTAGAQCERRASVGGTPVITQSKLPPRKQVINNNNVQASPAAPSPKQYPDQASAPQMPSTAACRKEATREFITEGSCRSRRPVRGARCTARTVGPGDSGGACPRATCCAPRKTKKRKIRLVCSDGTRYTKDIEIVRKCACGKKCPARNTPFLH-