Monarch geneset OGS2.0

DPOGS204082
TranscriptDPOGS204082-TA1119 bp
ProteinDPOGS204082-PA372 aa
Genomic positionDPSCF300200 + 282810-287033
RNAseq coverage43x (Rank: top 72%)
Annotation
HeliconiusHMEL0131462e-3649.66% 
BombyxBGIBMGA010817-TA3e-6564.84% 
DrosophilaCG42780-PA7e-1729.91% 
EBI UniRef50UniRef50_Q7YW393e-1930.88%Antigen 5-related salivary protein n=8 Tax=Culicidae RepID=Q7YW39_ANODA
NCBI RefSeqXP_316459.34e-2030.40%AGAP006421-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1148645642e-2031.91%gvag protein precursor [Anopheles funestus]
NCBI nr blastxgi|1148645646e-2031.78%gvag protein precursor [Anopheles funestus]
Group
KEGG pathwaytca:6563593e-07 
 K02330 (POLB)maps-> Base excision repair
InterPro domain[11-210] IPR0140441.2e-27CAP domain
[6-258] IPR0012833.4e-26Allergen V5/Tpx-1-related
Orthology groupMCL20381 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204082-TA
ATGCGGAGTGACGACCAGACATATTGCACTTTGAGATATCGAAGGCTTTGTATGGGGAAGGGATCACACGTCGCTTGTCAATTTCCTTTGGCGGGCGCGGGGGCTTCGTGTAGCAACTACACTAAAATCAAATTCACTAATGTTTTAAAGCATTTCGTGACGAGTTATATAAACAGACGACGTCAGCGGATAGCGTCGGGTTCCGAACGTGTCCGCGGCGGTGCTCCTTTACCGCGACCGGAAGTATGGGACAAAGAGCTAGCCTTTCTGGCCCAAAGGCTGGCAGACCAATGTAACTTCGTTCATGATGATTGCCGAGCTACAGTTCGTTATCCTTACGCTGGTCAGAGCGTGGGTGAAGTGCATTGGAGAGGTACAGAGGAGCTCAGCCTCCAACGAGCGATCAGACGCGTGTTGGACGCCTGGTGGGGGGAGAGGAGACGGGTCCAGCCGGAACAGCTCATAACCCCCTTCAGACTTACTAACAAAGGCAGTGTTTGGGGTCACTTCAGCCAATTGGCGGTGTGGTCTCTTAGGGCTGTCGGTTGCGGTGCCGTCATCCACGGATGGGATTATACTCGCCTGTTGTTAGTCTGCGACTTCTCTCACACCAACATGTTGGGACAGAGGACCATATCCCCGGGACCTCTGGCCCCGTGTCCGATACACACTGTGAGGAAACAAAGAAGTCCTTATCCTTTATTGTGTGCTCCCATTAAGCGATCCTTAGACACCGAAAATGAAGAAAACGATCTTAACAACGATTATCAGAACACACCAGACTATGACGGGATACGTGACACGGAAATAACAAAAAGATATTCTATGAACACGTATAAATATGACGAAACAACTAAAAAGAATATGCTGTCAATTCCAAGGAGATATACATGGCTGAAAGATTCAGAAATAAGTGATCAGAAAACGTCTGATTTGCTGAAACACAGGATCAAACAAATGAAATTGTGGAATAGTGTGAGGACGAGTATAAATTTGAGAAAGTATAACCGATGGGAGTCATGGCCGACACACGCAGACATGAGACCTGGAGCAAAGGCTTTATTGAATAAGCCGCTCGGAAAACTTCCCCAAAAGCCAAGCATCGTAAGATCAGATTAA

Protein sequence:

>DPOGS204082-PA
MRSDDQTYCTLRYRRLCMGKGSHVACQFPLAGAGASCSNYTKIKFTNVLKHFVTSYINRRRQRIASGSERVRGGAPLPRPEVWDKELAFLAQRLADQCNFVHDDCRATVRYPYAGQSVGEVHWRGTEELSLQRAIRRVLDAWWGERRRVQPEQLITPFRLTNKGSVWGHFSQLAVWSLRAVGCGAVIHGWDYTRLLLVCDFSHTNMLGQRTISPGPLAPCPIHTVRKQRSPYPLLCAPIKRSLDTENEENDLNNDYQNTPDYDGIRDTEITKRYSMNTYKYDETTKKNMLSIPRRYTWLKDSEISDQKTSDLLKHRIKQMKLWNSVRTSINLRKYNRWESWPTHADMRPGAKALLNKPLGKLPQKPSIVRSD-