Monarch geneset OGS2.0

DPOGS211471
TranscriptDPOGS211471-TA579 bp
ProteinDPOGS211471-PA192 aa
Genomic positionDPSCF300113 - 294845-297295
RNAseq coverage121x (Rank: top 57%)
Annotation
HeliconiusHMEL0109281e-2291.23% 
BombyxBGIBMGA002746-TA6e-3460.69% 
DrosophilaCG42342-PH1e-3355.56% 
EBI UniRef50UniRef50_E0VQF78e-3661.38%Collagen alpha-1, putative n=20 Tax=Coelomata RepID=E0VQF7_PEDHC
NCBI RefSeqXP_002428351.12e-3661.38%collagen alpha-1 precursor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2700010466e-3690.80%hypothetical protein TcasGA2_TC011335 [Tribolium castaneum]
NCBI nr blastxgi|2700010467e-4570.40%hypothetical protein TcasGA2_TC011335 [Tribolium castaneum]
Group
KEGG pathwaydre:4924596e-08 
 K10066 (COLEC11)maps-> Phagosome
InterPro domain[51-108] IPR0081606.8e-09Collagen triple helix repeat
Orthology groupMCL27587 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211471-TA
ATGACAAGTGTTCAAGCACTGAGAAGCCACATATTTCTTGCCGCCAAGCTCGGTGGTGATTGGTATGTGGTGGTAGGTGGTGGTCCCAAAGCGGTCACTCGCCGCGACCGACCCAGCTGTCCCGAGTGCTCCGTCCGCACTGGAGCGCAGGGCGAGCCGGGCAGCAAGGGTGAGCGAGGAGATCCTGGACTACCCGGAACAGATGGAATTCCAGGACAAGAGGGTCCGAAGGGTGACAAAGGCTATAAAGGAGAACCCGGACCAGGCGGAAAACGCGGCCGTAAGGGTGACAAAGGTGACCGTGGGGAGCAAGGAGTTCCGGGACTGGACGCACCCTGCCCGCTAGGACCAGACGGACTGCCACTGCCGGGATGCGGCTGGCGACCCTCGAAGGAAGTGGCGCGGGAGGAGCGGCTGGGAGGAGGAGGTGACGGGACGCGCTCGGAGGACGACGCGGAGGAAGAAGATGCGGAGCCAGAAGACGAGGGCGGTGACTATGAAGGGAGAGACGACCTCGAGCCGCCGAGAGACTACGACGACTACACAGACAACGCGCATCACGACTCGCACCGGGACTGA

Protein sequence:

>DPOGS211471-PA
MTSVQALRSHIFLAAKLGGDWYVVVGGGPKAVTRRDRPSCPECSVRTGAQGEPGSKGERGDPGLPGTDGIPGQEGPKGDKGYKGEPGPGGKRGRKGDKGDRGEQGVPGLDAPCPLGPDGLPLPGCGWRPSKEVAREERLGGGGDGTRSEDDAEEEDAEPEDEGGDYEGRDDLEPPRDYDDYTDNAHHDSHRD-