Monarch geneset OGS2.0

DPOGS212974
TranscriptDPOGS212974-TA1287 bp
ProteinDPOGS212974-PA428 aa
Genomic positionDPSCF300057 + 844913-846694
RNAseq coverage92x (Rank: top 62%)
Annotation
HeliconiusHMEL0056422e-16767.84% 
BombyxBGIBMGA011858-TA2e-6474.17% 
DrosophilaBHD-PA1e-6634.98% 
EBI UniRef50UniRef50_B3M8D22e-6534.69%GF25016 n=3 Tax=melanogaster group RepID=B3M8D2_DROAN
NCBI RefSeqXP_001956061.14e-6634.69%GF25016 [Drosophila ananassae]
NCBI nr blastpgi|1947472417e-6534.69%GF25016 [Drosophila ananassae]
NCBI nr blastxgi|1947472412e-6234.69%GF25016 [Drosophila ananassae]
Group
KEGG pathwaydan:Dana_GF250161e-65 
 K09594 (FLCN, BHD)maps-> Renal cell carcinoma
InterPro domain[62-213] IPR0217139.3e-35Folliculin
Orthology groupMCL15197 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212974-TA
ATGAATGCAATTGTTGGTTTTTGTCATTTCTGTGAGGCTCATGGACCAAGACCAGTATTTTGTACATATACTACTGAAGACGAACAACACACCACCGAGCCCTCTAAAAATGCAGTACAGTGCAATGGCTGTACATCTATAGGTTCAGAAATGGTACTAATATCAAGAGATGGTGATACAATCTTTTGTAGCAGGGAATCTGTTCCAAATCCTGAAGTTACCTCTTTTTTGAGACAAGCTGCTTTAAGAAGTATTACTTGTGAAGTAAATTGGAGTAAAGAAGGTGGAGTGGTTTATTTCAGTGATACACAAGGAGATGTGCTCAGCTCTACATTTCAATTAAAGGATACTAGATCAAGGGGATTGAAAAGGTTGTTCTCAATTGTTGTTTTAATGAAGGATAAAATGTTACTATTAAATATTACACCAGTACTCTCAGAACACATGCAGAAAATTGCAAAAGAACTGCAAAATCTAGCTAATGAGGTCTATGAACAAGAACAAAGTATATGCTCTCAAAGAGCTTTAAGGCTTAAGACAGGACGACATGACTTTGGCCAGTCCCGATCCTTAGTACAATTAACAGGAGATGCAGACATTTTTAAAAAACTCCACTCTCATTTTACACACATCCTCAGAATAGGTTCAGTAACTTATTCTGAAACTTTATATACAAGTCATGGCTTACTTAATAAAATCACTCCACAATTAACTAGCAACACAATATTTCAAGAAAATGCCTGCGTTGTTCCTAATAACAACTGCCTGACATTAAGGGAATTAGAAAGTCTATTAACAAAGGAAGCTTTTAAAAAAGTATTGTATTGCATCTTAACTGGAGTTCATATAGTCATAAAATGTGTCAACTTTGAACCAACAAGAATTATAGATTGTTTATTAAAAATAATTCCTAGTACAAGTGTAGATATTAACAGGCCTATAGTCTCGGTTGGTGGAAACAACACTGAAACATATGATAATTGCTGTATAGAAGAAGTAGAAAATAATGATTTTGTTTGTAAATGGAGTGGAAGTTTGCCTGATAAATGCCCCACTTTGATGAGGAGGATAGAAATTGCTATGGATAACCAGAAATTAACAAATGCTGTATTGGACCAACATATAAAATCTCTTCAGTTGGAATGGTTGGGGATAGCGAACACAATCAAAATGGCGAAAACAAATTCAGGAAAATCGGATGCCATAAAAAAATTGAAAGTTGTGTTAGGTGTGATGCAGCAAGATGAAGTACTCGTTAACTACTGGTCGTGCATGTTTTGTAGTTAG

Protein sequence:

>DPOGS212974-PA
MNAIVGFCHFCEAHGPRPVFCTYTTEDEQHTTEPSKNAVQCNGCTSIGSEMVLISRDGDTIFCSRESVPNPEVTSFLRQAALRSITCEVNWSKEGGVVYFSDTQGDVLSSTFQLKDTRSRGLKRLFSIVVLMKDKMLLLNITPVLSEHMQKIAKELQNLANEVYEQEQSICSQRALRLKTGRHDFGQSRSLVQLTGDADIFKKLHSHFTHILRIGSVTYSETLYTSHGLLNKITPQLTSNTIFQENACVVPNNNCLTLRELESLLTKEAFKKVLYCILTGVHIVIKCVNFEPTRIIDCLLKIIPSTSVDINRPIVSVGGNNTETYDNCCIEEVENNDFVCKWSGSLPDKCPTLMRRIEIAMDNQKLTNAVLDQHIKSLQLEWLGIANTIKMAKTNSGKSDAIKKLKVVLGVMQQDEVLVNYWSCMFCS-