Monarch geneset OGS2.0

DPOGS204559
TranscriptDPOGS204559-TA2070 bp
ProteinDPOGS204559-PA689 aa
Genomic positionDPSCF300297 + 424805-435274
RNAseq coverage85x (Rank: top 63%)
Annotation
HeliconiusHMEL0087273e-8357.45% 
BombyxBGIBMGA004329-TA1e-6447.98% 
DrosophilaCg25C-PB5e-3541.81% 
EBI UniRef50UniRef50_E9H0J76e-6334.41%Putative uncharacterized protein n=1 Tax=Daphnia pulex RepID=E9H0J7_DAPPU
NCBI RefSeqXP_002426529.14e-6031.07%collagen alpha-1 precursor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3504167451e-6631.28%PREDICTED: collagen alpha-1(XXIV) chain-like isoform 1 [Bombus impatiens]
NCBI nr blastxgi|2420115855e-13237.65%collagen alpha-1 precursor, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00071559.4e-07cell adhesion
GO:00051989.4e-07structural molecule activity
KEGG pathway 
InterPro domain[32-217] IPR0089853.4e-20Concanavalin A-like lectin/glucanase
[408-465] IPR0081601.9e-09Collagen triple helix repeat
[35-208] IPR0031299.4e-07Laminin G, thrombospondin-type, N-terminal
Orthology groupMCL25008 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204559-TA
ATGGTAGTCCAGGGTGTGTTATGTCACTCTGGAATATTTGGGGTTACAGCGTCTCCGTTAAACGCCTGCCAGTCACTTCGTCCGGGCGACATAGACTTCCAATCAGTGGATCTGATAGCGGTGTACCGTTTGGACGGAGCTGATACAACAGGGGTCACCTTGGTCCAGGGCTCTCAGGACCTGCAGAGGGCGTACCGCGTTGGGGATGGAGCGAATCTTACTTTGCCTTTACGCGAGGCTCTACCAGCGGGTTTGCCTTCAACGTTCACTATAACTTCTACCTTCAGAACAAACAATCGCCGACCGTGGAGTCTTATCAGAGTTCGTTCTACTTCTCTTCTTTTCTCTATAACTCTCTTACCAAATGTTAAGAAGATGGCCGTTTTCGTTCAGGGATCACGTGTTGTCTTCTCTACGCCCACACTGTTCAAGCCGTTCTGGCACAAAGTTCATATAGCTATAGACAACGATACTGTACACGCGGCCATAGACTGTAATGAGTTGGAGCCGGAATCAATAGGTGGATGGGACTTTGATAACGCGACCAGCATCAGTATCGTCTCCAACGATGATGGAACCCCAGCTCCTGTAGACCTCCAATGGCTGTCATTGAGTTGCAATCGCTACAACATTACAGAAGACAGTTGTGAAGAAATCGAAATACCGGAGTCACTTATCGCAACCGTCACCCCTCCAATCGTAACACCGGGAGCAAACGACTTTCCTCTTGTATGTAATCAAACATGTCCACCAGGTCCAGTGGGTCCGCCGGGGCCACCGGGAGAGATTGGTCCTCTTGGGTACACGGGACTGCCAGGGAAACGAGGTGTGGACGGGCCTCCAGGCCCCCTAGGTCCGACGGGACCTAAAGGAGAAAAAGGAGACATAGGTCCCCCGGGCTCTGCAAGCAATGTCTCCGTGATTGGCCCGCCAGGGGTACCTGGGAGAAAAGGTTCCAAAGGTGACAAAGGAGATTCGGGAGAAAAAGGTGATAGAGGTGATGTGGGCCTGGTGGGCCTGGCTGGAGTCCCCGGTGTCGATGGGAAGGATGGTCCACCAGGTCCCGTCGGTCCTCCAGGGGCTCCCGGTGAACCTGGCCCTGTGGGCCCCCCAGGACCCGCCAGTAAAGGATTTCTTCCTCTTGTGCAAGGTAGTAAGGGCGAGCAAGGAATTCCGGGGGAGCCTGGTAGAGATGGCTACCCCGGAGTGAGAGGTTTACCAGGATTAGATGGAACACCGGGAAGCCCGGGTATCCAAGGGATGCAAGGGTTACCTGGATTGCCCGGAGAGAGAGGCTTGATTGGTCTTCCGGGTACTCCGGGAGAAATGGGCCCTGAAGGTCCAGCGGGACCCCAAGGTCCGCCAGGACTTCCAGGACCTGCCGGTCCTCCAGGTGTGAGTACATCAACAGCTGGTGTTACCGTTCCAGGCCCTCCAGGGCCACCGGGTTTAATGGGTCTGAAAGGTGAACAAGGATTCCCAGGATTGCCAGGGCGAGATGGCTTAGACGGTATCCCTGGGCTACCCGGGCAGAGGGGTCCTCCAGGGCCCCCCGGTTCACTTAGCTTAGTACAAGAACAACGTCCATCATTATCAGAAAACGACGTGAGAAACATCTGTGAGGACATAATAAAAGTGCGTCTGGCTGACTTTTCATCGGGTCTTGTGATGCCAACTGCGAAACCGGGACGCAGAGGCCCCCCAGGACCGCCGGGGGCACCAGGGAGCCCCGGTTCCGTGGGCGAAACAGGACCGATGGGTCCCAGGGGGTATCCGGGTGAAACAGGCGAACCTGGTCGACCAGGATATCCAGGACCCAGCGGCGACAAAGGAGACAAAGGGGATAGAGGTCCTCAAGGAGTGGGCATCCCAGGTCCAGAGGGATCACCCGGGATGACTGGTCCCATGGGTCCCGCTGGGATCGAAGGAAGAACTGGACCTCGGGGTGATCCGGGCCCGTCAGGTCCTGTAGGCCCACGGGGAGTTCCAGGTCCAAGAGGAAGCTGCGACTGTTCATCATCAGCGTATTACGCGTACGCGCCCATTTTAGGGAACAACAAAGGACCCTAG

Protein sequence:

>DPOGS204559-PA
MVVQGVLCHSGIFGVTASPLNACQSLRPGDIDFQSVDLIAVYRLDGADTTGVTLVQGSQDLQRAYRVGDGANLTLPLREALPAGLPSTFTITSTFRTNNRRPWSLIRVRSTSLLFSITLLPNVKKMAVFVQGSRVVFSTPTLFKPFWHKVHIAIDNDTVHAAIDCNELEPESIGGWDFDNATSISIVSNDDGTPAPVDLQWLSLSCNRYNITEDSCEEIEIPESLIATVTPPIVTPGANDFPLVCNQTCPPGPVGPPGPPGEIGPLGYTGLPGKRGVDGPPGPLGPTGPKGEKGDIGPPGSASNVSVIGPPGVPGRKGSKGDKGDSGEKGDRGDVGLVGLAGVPGVDGKDGPPGPVGPPGAPGEPGPVGPPGPASKGFLPLVQGSKGEQGIPGEPGRDGYPGVRGLPGLDGTPGSPGIQGMQGLPGLPGERGLIGLPGTPGEMGPEGPAGPQGPPGLPGPAGPPGVSTSTAGVTVPGPPGPPGLMGLKGEQGFPGLPGRDGLDGIPGLPGQRGPPGPPGSLSLVQEQRPSLSENDVRNICEDIIKVRLADFSSGLVMPTAKPGRRGPPGPPGAPGSPGSVGETGPMGPRGYPGETGEPGRPGYPGPSGDKGDKGDRGPQGVGIPGPEGSPGMTGPMGPAGIEGRTGPRGDPGPSGPVGPRGVPGPRGSCDCSSSAYYAYAPILGNNKGP-