Monarch geneset OGS2.0

DPOGS206024
TranscriptDPOGS206024-TA1119 bp
ProteinDPOGS206024-PA372 aa
Genomic positionDPSCF300028 - 1760405-1775973
RNAseq coverage1672x (Rank: top 8%)
Annotation
HeliconiusHMEL0109018e-11875.00% 
BombyxBGIBMGA000552-TA2e-11073.71% 
Drosophilafax-PA3e-11558.19% 
EBI UniRef50UniRef50_Q240363e-11358.19%Failed axon connections protein n=39 Tax=Pancrustacea RepID=Q24036_DROME
NCBI RefSeqXP_968396.22e-12161.41%PREDICTED: similar to failed axon connections protein [Tribolium castaneum]
NCBI nr blastpgi|2700030725e-12261.93%hypothetical protein TcasGA2_TC000100 [Tribolium castaneum]
NCBI nr blastxgi|2700030729e-13062.07%hypothetical protein TcasGA2_TC000100 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[68-169] IPR0123362e-13Thioredoxin-like fold
[178-304] IPR0109872.5e-12Glutathione S-transferase, C-terminal-like
Orthology groupMCL14374 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206024-TA
ATGGCTACTGAAGTTATTAATAATGTACCCGAAAATACTGAAAAGGAGAAGGAAGTAAAAGACGACGCCTCGCCAGATAATGGGCAAACAAAAGAAGCTCCACCCGAAGGGGAAAAAGCTGAAGGGAAAGTAGAAGAAAAAAAGGCTGAAGTGGTACAGCCAAAACCCAATGTTCACAAAATTAATTTCGAAAAAGATATCGTTTATTTGTACCAGTTTCCGCGAACGCCTTTGTTGCCGTCAACTTCGCCCTACTGCTTGAAACTTGAAACATGGCTCCGTCTAGCCGGCATTAAATACGAAAACGTCGATCACAAGGCAAAATTCCGCTCCAAAAAGGGCCAGCTTCCATTCGTTGAATTGAATGGAGAAGAAATTGCTGATAGCACCTTCATCATCAAAGAGCTCTCGGAGAAATTTACTAAAGATTTAGATGCCGGTCTGACCCCCGATCAGCGGGTGGTTGCCCACGCCATGGCGTCTATGATTGAAAACCATCTTTCTTGGGTGATTTTCTGGTGGCGTGCTAAATACCCTGACAGTATGATCAAGGGTTACCAAGTTAATTTGCAGAACGCTCTAAACACGCGCCTGCCGAATCCTATTCTGAACTTCTGTTACAAGTTCTCCCTTGGACGTAAGGGGATGAAGAAGGCGAAGGCCCATGGGATCGGCGTTCACAGCCAAGACGAGATCATCGAACTCGGCAAGAACGATCTTCGCGTACTCTCCGATCTGCTCTCCGATAAGCCATACTTCTTCGGAGACGAGCCCACCATTCTTGATGTAGTTGCTTTCGCCAACTTGGCCCAATTGCACTTCATCGACAAGGATGTGCAGCATGCCTTGCGGGACGCTCTGGCTGAGTCCTTCCCCAACCTCGTTGGACTGGTCACACGTATCAAGGAGCGAGCCTATCCCGATTGGGACGAACTCTGGTCGTCCACCGAAAAGAGCGCCAAGGAAGCAGAGAAGCCCGCTGATGACGCCGAGAAGGGCAAGGGCGACGAGCAAGAGAAGGAACTAGAGAAAGAGCCGGAGGAGAAAGAGAAGGAAAAAGAGGAAAAGGAGAAAGAGAAGGAAAAGGAGAAAGAGAAGGAGAAGGAGAAAGAAAAGTAA

Protein sequence:

>DPOGS206024-PA
MATEVINNVPENTEKEKEVKDDASPDNGQTKEAPPEGEKAEGKVEEKKAEVVQPKPNVHKINFEKDIVYLYQFPRTPLLPSTSPYCLKLETWLRLAGIKYENVDHKAKFRSKKGQLPFVELNGEEIADSTFIIKELSEKFTKDLDAGLTPDQRVVAHAMASMIENHLSWVIFWWRAKYPDSMIKGYQVNLQNALNTRLPNPILNFCYKFSLGRKGMKKAKAHGIGVHSQDEIIELGKNDLRVLSDLLSDKPYFFGDEPTILDVVAFANLAQLHFIDKDVQHALRDALAESFPNLVGLVTRIKERAYPDWDELWSSTEKSAKEAEKPADDAEKGKGDEQEKELEKEPEEKEKEKEEKEKEKEKEKEKEKEKEK-