Monarch geneset OGS2.0

DPOGS210720
TranscriptDPOGS210720-TA2244 bp
ProteinDPOGS210720-PA747 aa
Genomic positionDPSCF300013 - 121117-127080
RNAseq coverage220x (Rank: top 45%)
Annotation
HeliconiusHMEL0161700.092.40% 
BombyxBGIBMGA006332-TA0.086.86% 
DrosophilaCG42613-PE0.061.62% 
EBI UniRef50UniRef50_B0W9D00.056.71%Putative uncharacterized protein n=1 Tax=Culex quinquefasciatus RepID=B0W9D0_CULQU
NCBI RefSeqXP_321719.40.062.15%AGAP001415-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479658480.062.15%AGAP001415-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479658480.062.05%AGAP001415-PA [Anopheles gambiae str. PEST]
Group
KEGG pathway 
InterPro domain[146-265] IPR0008591.6e-26CUB
Orthology groupMCL15821 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210720-TA
ATGTTTGTAGAGTGCGACCGTACCTTCGTGAGTCGCGGGGGAGCCAACAACGGTACATTCCACGCCCCGGAGCTGCTGAATCCCAACAACCACAGCCGGCAGTGTCTCTACACCTTCCTAGCAGCACCAGGGCAGCGCGTCCTCGTCGAGTTCAGAACCTTCGACCTCAGGGGCAAACCTCCAGAATGCATGCACGAGTACATGGACATCTACTCGGAGATGGCGTCTCTGGAGGCGGGCGAGTTAGTGAACTCAGCCTTCGGTGGACGCTACTGCGGCCCGATCCCTCCACGGAGGCGGGTGTCGCTACACCGGGCGGTCGCTCTCGCTTTTTTTACAGATCACACTTACACACCACCAACACTGTTCGCGGGAACGTATCGTTTCATCAACGCCTCGGAATACGAAGTGGGAACTCCGGTACCCAACACACTGTGCTCGTTCGTGATCGAGGCCAGCAAACGCAAAACTGGTCTCCTTTTGTCGCCTACGTATCCGGGACTATACCCGAAGGACACCACCTGCAACTATCAATTCGTGGGACAGCCCGGCCAGCGAATACGACTCGAGTTCAGAGACTTCGACCTGTTCTTCGGGGGTCCCCATTGCCCGTTCGACTGGGTGCGCGTATACGACGGGCCCGACAACTCCTCGGCGGCCGTGGGCACGTACTGCGGACAACAACGCAACCTCGTCCTCTACTCCAGCGACGAGAGGCTGCTAGTAACTTTCTTTACTCTGCCGCGGTCAGCCAACACCCAGAATCGAGGGTTCAAGGGTATCTTCGAGTTTTCAGAGAGTTTTGTTAAATTAGATTTTATAAGCAAACACGACGCCGAGCACATCAGGGGCTCGGAGTGCGATCAGAAAATACTTAGCAAAAAGGAATCGACAGGATTCGTTTATCACCCAAATTATCCGTTTCCTTATATACAAAAAGTTGTTTGTAGATATTTTATATATGGTATGCAAGACTCGCAAAATTTAGAAAGAGTTCGATTGGAGTTCCAGAACTTTTCGATACCGAAGGGAGAAAATCCTAAACCGGATTCGTGTCCCGATGGTTACCTGAAGGTGTATCTGCGCGGTCAGGAGGCGACTGATTCCTACGACAAACACGACGCGGAGCTGTGCGGCGAGGGCTCGGCTGGCCCGCCCGCCTTCCCGCCCCCGCTGCTGTCTGACGGACCGAGGCTTGTCATGGTCTTCAGCTCCGGAGAATTGCAGGGCAGAGGATTTAAAGCCAAATATACTTTTGAGACTGAATATCGAGTTCCAGGGACAGCGGCGCCGGGTGGCGAGTGCGCCTTTACATATCGATCCGAAGCGAAGAAGAGAGGAGAGTTCAACTCCCCGCGGTATCCCTCCAACTATCCCTCGCGCACCAACTGCACCTACACCCTGGTGGCCACTCCCAACGAACAGGTCACGGTCGTGTTCGATCACTTCAAGGTCAAAGCGGACACCTGGAACGCTACCGCGGGCTTGTATGGCGGCGCAACATGTATCGAAGATTGGGTGGAGGCGTGGTGGACGGGACGAGAGGGCTCGCGAGTCCCTTTGGGACGCTGGTGTGGCCCCGCTACCCCCGGACCCTTGCAATCACCTCGCGGAGCCCTGGGCCTGCTCATCGCTTTGCACACCGACTACGACTCGGTCGCTTCAGGATTCAAGGCGAGATATGTATTTGAGCCAGCCAAATCAATCTTCGGCGACTGTGGCGGCAACGTATCAGGATCGGCGTGGGGGGTGGTGTCTTCCCCGCGCTACCCGTTACCGTACGAGGCGCCCTCGAGAGGAGCCGCCTCCAGGGTGTGCAACTGGTTCATCACTGCACGTCCGGGGAAACGACTACTCATCAACTTCGATCAGTTCGCCGTTGAGGGACATCTTACGGAGCGTGGTTGTCCGGCGGCGGTGCTCCGCCTCTGGTACGAGTCCCCTGGCCCGCCGCTGGAGCTGTGCGGGGAGAAGGCGCCCGCCGACCGCTGGCAATACCTCTCCTCCTCAAACTCCATTAGACTTTCATTCATCATAGCCGACAAGTCAGTAGGCGCCGCGGGGTGGCGCGCGATATGGACCGAGGTGACTGTGGGCCCCCCCGGGGGATCGGGCGTGTCGGGTTCGGGTGGGGGGGAGGAGTGCCCCGCTGTGTGCGGGGGGGCTTGCCTACCACCACGCGCCGCCTGCTCCGGCTTGCAGCACTGCGCCGGCCAGCCCACCACTCAGCCCTCATACTGTTAG

Protein sequence:

>DPOGS210720-PA
MFVECDRTFVSRGGANNGTFHAPELLNPNNHSRQCLYTFLAAPGQRVLVEFRTFDLRGKPPECMHEYMDIYSEMASLEAGELVNSAFGGRYCGPIPPRRRVSLHRAVALAFFTDHTYTPPTLFAGTYRFINASEYEVGTPVPNTLCSFVIEASKRKTGLLLSPTYPGLYPKDTTCNYQFVGQPGQRIRLEFRDFDLFFGGPHCPFDWVRVYDGPDNSSAAVGTYCGQQRNLVLYSSDERLLVTFFTLPRSANTQNRGFKGIFEFSESFVKLDFISKHDAEHIRGSECDQKILSKKESTGFVYHPNYPFPYIQKVVCRYFIYGMQDSQNLERVRLEFQNFSIPKGENPKPDSCPDGYLKVYLRGQEATDSYDKHDAELCGEGSAGPPAFPPPLLSDGPRLVMVFSSGELQGRGFKAKYTFETEYRVPGTAAPGGECAFTYRSEAKKRGEFNSPRYPSNYPSRTNCTYTLVATPNEQVTVVFDHFKVKADTWNATAGLYGGATCIEDWVEAWWTGREGSRVPLGRWCGPATPGPLQSPRGALGLLIALHTDYDSVASGFKARYVFEPAKSIFGDCGGNVSGSAWGVVSSPRYPLPYEAPSRGAASRVCNWFITARPGKRLLINFDQFAVEGHLTERGCPAAVLRLWYESPGPPLELCGEKAPADRWQYLSSSNSIRLSFIIADKSVGAAGWRAIWTEVTVGPPGGSGVSGSGGGEECPAVCGGACLPPRAACSGLQHCAGQPTTQPSYC-