Monarch geneset OGS2.0

DPOGS205117
TranscriptDPOGS205117-TA3783 bp
ProteinDPOGS205117-PA1260 aa
Genomic positionDPSCF300172 + 57203-63176
RNAseq coverage342x (Rank: top 34%)
Annotation
HeliconiusHMEL0078570.070.13% 
BombyxBGIBMGA005876-TA0.064.75% 
DrosophilaNdg-PB0.034.79% 
EBI UniRef50UniRef50_D1ZZG70.044.28%Putative uncharacterized protein GLEAN_08043 n=1 Tax=Tribolium castaneum RepID=D1ZZG7_TRICA
NCBI RefSeqXP_972325.10.044.28%PREDICTED: similar to nidogen [Tribolium castaneum]
NCBI nr blastpgi|910808710.044.28%PREDICTED: similar to nidogen [Tribolium castaneum]
NCBI nr blastxgi|910808710.044.36%PREDICTED: similar to nidogen [Tribolium castaneum]
Group
Gene OntologyGO:00071607.7e-33cell-matrix adhesion
GO:00055095e-08calcium ion binding
GO:00055155e-05protein binding
KEGG pathway 
InterPro domain[1035-1260] IPR0110423.3e-58Six-bladed beta-propeller, TolB-like
[308-528] IPR0090173.5e-57Green fluorescent protein-like
[307-533] IPR0066059.5e-48G2 nidogen/fibulin G2F
[91-244] IPR0038867.7e-33Nidogen, extracellular domain
[1144-1189] IPR0000334.5e-12LDLR class B repeat
[569-605] IPR0130911.2e-08EGF calcium-binding
[569-609] IPR0018815e-08EGF-like calcium-binding
Orthology groupMCL13586 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205117-TA
ATGTGGTCAGCTGCGTTGGTCGTGTGCCTTGCGGCTTGCGCGCGCGCAATCACCCGCGATCAATTTTACCCCCACGGTCATGGATTGGACCAACGCTTACCCCGGGGAGCTGAGGTCTCTTCCCCAGAAGTAACCCTGGCTGTACCAGTCGTGTTTTACGGTCAGACCTACGAGTCGATTTTCGTTAATAACTTCGGAGTGCTTTCATTCAGAGCCGACATTCCGACATTTCTGAATGCTGAATTCCCTCTACCATATCCATCGATAGCTGCGTTTTATACCAATATTAATACAACGGATGTTGGCACGGTCTACTATAGAGAAACAAATGAATCCCATGTGCTATTAAAAGCTGAGGAGAGCGTTCAGAATAACTTCCACGATTATTATGACTTCATCCCGACCAGTGTGTTTATTGCCACTTGGATTGATGTCACTTACTCGGGTTCCCAGTGGCAGAATCGCAAGAATAGCTTTCAAATAGCTATCATAAGTAATGGGACGGAAACGTTCGTCGAGCTTTTGTATCCGGAGAGAGAAATTCAATGGATACAACGAGAAACAAAAGATGGCGGCCTACCGGATGCTAAAGCGCAAGCTGGTTTTGTTGCGGAAGACGGACGTGTATTTACTCTAAGAGGTTCTGGCAGTCATCAGATCAGGAATGTCGTTTCTTGGTCAAACATTCATGACCCTGGAAGATACGTCTATCGTGTAGGAAATATCCCATTAGAGGGCACCATAGCCGTTCCTGATCAGTATGATCAATATGAGGCTGAAGTTGAAGAAGAATCTAAAACTTGTGCCCAAAGTGGTCCTAGTGTATGCCACTTGCAAGCAAGATGTGTTGATTATCAAGCCGGTATTTGTTGTCAATGTAATGAAGGTTTTTATGGCAATGGAAAATCGTGCATCAAAGACGATGTGCCGCTACGTGTACATGGAAAAATGAATGGTATTATTAACAATCAAAATTTAAATGATGTTGATATTCAGGCTTATGTGGTTCTAGCTGATGGCAGATCATATACAGCTTTATCTCAGACACCGTCATCTTTAGGTAGCAGTTTGCAACTTCTTAGTGTACTGGGAAGTGTCGTAGGTTGGCTTTTTGCCAAGCCATTAGGCGAAGCCCAAAATGGTTATCAGTTGACTGGTGGTCTTTTCAATCACACCGCAGATATTTACTTTCCTGAATCTGGTGACAGAGTAACAATTAATCAAGAATACGTTGGACATGACGTGTTTGACCAAATAACTTTGGACACTGATGTACGTGGTACAATCCCTAACGTACCTACAGGATCCAGACTAGAAGTGTCTGAATACGATGAGCAATACACTATTGTCGAACCAGGTCTCATTCAAAGTGTATCAACACGGATATTCATGAACAAAATAACAGGACAAAAATATGAACAGAGAGTTTCGCAGACGTTTACTTATAGTCCATGCAAGTTTGCACCACCGTCTGAAAACGCAAATAAGCCGCTAACATTGAAAGTTATAAAAAATTATTTAGGATACGAAACAAGAGGAAATATTGTTCGATATGGAACAACAAATAAAATTCAGTCAAATATTCAAGATCCCTGCGCAGTAGGCAGGAATTCTTGCGGTCCCCATAGCACATGTGTTGTACAGGGTGATTCCTTTGTATGCGTATGTCAATTAGGTTTTAAGAATAATAATGAAAATTGTATTGACATAAATGAATGTGAAGCCGGAACACATAACTGTGACAATAACGCTGACTGTTACAATCAAGATGGCGACTACCAGTGTATATGCCGAGAGGGTTACGAAGGGGATGGAATAAGCTGTAGAAGCATTTCAAATTGTAGGAACAAAGTCTGTGATCAGAATGCTCAGTGCACAGAAAATCCTCTTGAAGGCCCAGTCTGTGTATGTAATCCAGGATTTACTGGAGACGGGGAAAGATGTTGGACCGCATACTATAATGCGTGCATTAACTGCTCCCCAAATGCTCAATGTCGACGGTCAGATGACAGTAATACCGAAAGATGTTATTGCAATCCCGGTTTTATTGGTGACGGACAATCTTGTGTGGAAGAAGTGACAACTGAACCATACGAGCCAGAGACCACCTCAGTGTCTGTTGCTTTTACACAGTCGACTACTACAGTAATACCTGAAAGTGAATACAATCAAACCTATGTTTTACCTAACTGTGATCTTTACGAATGCATTTGTCCTCCAGGATATTCAAGTTTCAAAGATGATAGAAATAACGACCTGTGTCGTCTTGATAACAATGACCAGGAGAATGATTTGGATGAAAACAAATACAACTCGAATTCTATGAGGTGTACTGCGGACGCCGATTGTCCACCAAACGCGGTATGCGCGTTTAGCTACTACTACTCTTCAGACGATTCTGGTTTAGGACATTGTGTTTGTCCGGAAGGATATGAAGGTGACGCATATGAGTGTATTGAAAAAACAGGACCCAGTTGTTCCTGTGGTCCTGCGGCCCATTGTATCGATACAGTAGGCGGCCAGCTCATATGTGTATGCGATGCTGGTTATCATGGGGATGGTTATATATGCCGTCCGAACTTCAGCTGTACAAACAATTCAGACTGTGAATACAACGCTGAATGTCGACCTGATGCAAGCACCAATGAATACGTTTGTCAGTGTATAGAAGGATATGTCAAAGATGAGAGCGATGCGTGTATTAAAGATGGACAGCTCTGTAATGGCGCCGTATGTTCAGAACACGCTTCCTGCTTATACGACGCCGCTATCGATATTAGTTATTGTTACTGTGATGAGGGATATGATGGTGATGGTATTTCTAAATGTGTCCCCAAAGGAAAAACCTGTGACGTTGCCAATGATTGCGATCCGAATGCCATTTGTACTCCAACAGAAATTTCTTATCAGTGTATCTGTCGTGAGGGCTTTACTGGCGACGGCTATACCTGTACCCCAGAAATGAATTGCAAATATAATATATATTTATGTGATGATCATGCTTCGTGTTTAAAGACGAGCGATGGGTATGAGTGCGAGTGTAATACTGGTTATAACGGTAATGGAACTCATTGCCAGCTCAATCCACGACAGGCCGGGAACTTCCTGGTGGCGAGCGATGGCGCTTCCGTTTATCGCGTACCATTCAGAGTGACACCGAGGGAGTTTGCAGCTCCAATAAACAGCGGTGCAATTCAGATAGCTGTGGGCATAGACGTAGACTGTTTGACCGGAAAAATTTATTGGGGAGACGTCAGTGGTGCCACAATCAAACGAGCGTCATATGATGGTTCTGGATTCGAGTCGTTCCTATCAAATGATGTCCAATCGCCAGAAGGTTTGTCAGTGGACTGGTCAGCTAGAAACGTCTTTTGGACGGACTCGAAAAAGTTGACTATTGAGGTAGCCAACATTGACACTAAAATAAGGAAAGTCTTGTTCCAAAGAGAAGGTATACACAATCCAAGGGGTATAGCCGTTCATCCAGGGAAAGGTAAAATCTTTTGGAGCGACTGGAATCGCGGTGGACCAAAGATAGAGTGGGCGAGTATGGATGGTTCTCAGAGGGGTATCTTTTTGGACCAATCAGATGTAAAATTGCCAAACTCATTGGCCATAGATTGGTCCAGAGATAGACTGTGTTACTCCGACGCTGGGTTTGCTAGCATAAAGTGCGTCGGTATAGATACCTTGGAAAAGGAAACCATAGCTGTGAATTGCTCGTATCCATTTGGTTTGGCCATCAGTGGAGATACTTACTACTGGACCGATTGGAAAACGTAA

Protein sequence:

>DPOGS205117-PA
MWSAALVVCLAACARAITRDQFYPHGHGLDQRLPRGAEVSSPEVTLAVPVVFYGQTYESIFVNNFGVLSFRADIPTFLNAEFPLPYPSIAAFYTNINTTDVGTVYYRETNESHVLLKAEESVQNNFHDYYDFIPTSVFIATWIDVTYSGSQWQNRKNSFQIAIISNGTETFVELLYPEREIQWIQRETKDGGLPDAKAQAGFVAEDGRVFTLRGSGSHQIRNVVSWSNIHDPGRYVYRVGNIPLEGTIAVPDQYDQYEAEVEEESKTCAQSGPSVCHLQARCVDYQAGICCQCNEGFYGNGKSCIKDDVPLRVHGKMNGIINNQNLNDVDIQAYVVLADGRSYTALSQTPSSLGSSLQLLSVLGSVVGWLFAKPLGEAQNGYQLTGGLFNHTADIYFPESGDRVTINQEYVGHDVFDQITLDTDVRGTIPNVPTGSRLEVSEYDEQYTIVEPGLIQSVSTRIFMNKITGQKYEQRVSQTFTYSPCKFAPPSENANKPLTLKVIKNYLGYETRGNIVRYGTTNKIQSNIQDPCAVGRNSCGPHSTCVVQGDSFVCVCQLGFKNNNENCIDINECEAGTHNCDNNADCYNQDGDYQCICREGYEGDGISCRSISNCRNKVCDQNAQCTENPLEGPVCVCNPGFTGDGERCWTAYYNACINCSPNAQCRRSDDSNTERCYCNPGFIGDGQSCVEEVTTEPYEPETTSVSVAFTQSTTTVIPESEYNQTYVLPNCDLYECICPPGYSSFKDDRNNDLCRLDNNDQENDLDENKYNSNSMRCTADADCPPNAVCAFSYYYSSDDSGLGHCVCPEGYEGDAYECIEKTGPSCSCGPAAHCIDTVGGQLICVCDAGYHGDGYICRPNFSCTNNSDCEYNAECRPDASTNEYVCQCIEGYVKDESDACIKDGQLCNGAVCSEHASCLYDAAIDISYCYCDEGYDGDGISKCVPKGKTCDVANDCDPNAICTPTEISYQCICREGFTGDGYTCTPEMNCKYNIYLCDDHASCLKTSDGYECECNTGYNGNGTHCQLNPRQAGNFLVASDGASVYRVPFRVTPREFAAPINSGAIQIAVGIDVDCLTGKIYWGDVSGATIKRASYDGSGFESFLSNDVQSPEGLSVDWSARNVFWTDSKKLTIEVANIDTKIRKVLFQREGIHNPRGIAVHPGKGKIFWSDWNRGGPKIEWASMDGSQRGIFLDQSDVKLPNSLAIDWSRDRLCYSDAGFASIKCVGIDTLEKETIAVNCSYPFGLAISGDTYYWTDWKT-