Monarch geneset OGS2.0

DPOGS210641
TranscriptDPOGS210641-TA1734 bp
ProteinDPOGS210641-PA577 aa
Genomic positionDPSCF300401 - 203272-208167
RNAseq coverage112x (Rank: top 59%)
Annotation
HeliconiusHMEL0107837e-5928.62% 
BombyxBGIBMGA001632-TA1e-9642.15% 
DrosophilaCG6654-PA8e-2328.21% 
EBI UniRef50UniRef50_E0VND54e-2427.70%Gonadotropin inducible transcription factor, putative n=2 Tax=Neoptera RepID=E0VND5_PEDHC
NCBI RefSeqXP_002427629.17e-2527.70%gonadotropin inducible transcription factor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420138821e-2327.70%gonadotropin inducible transcription factor, putative [Pediculus humanus corporis]
NCBI nr blastxgi|1953956803e-2930.58%GJ10213 [Drosophila virilis]
Group
Gene OntologyGO:00036769.4e-06nucleic acid binding
KEGG pathway 
InterPro domain[465-504] IPR0130879.4e-06Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL26726 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210641-TA
ATGTCCAACGTCTTGTGTTTTGTATGCTACGGCGCAGTTCATTCGGATATCAGTGACGGAACACGAGCTAAATACCGCGACTTCGTCGGCATTAGCCTGTGTCCGGAATCCCAGCTGTGCTACATATGCTGTCACATACTCAATAAGATGTGTATTTTCAAATCGCTGTGTCTTAAAAGGAGCACAGACTATCCTATATTTAGCGAAAAAGATATACTAAGATTACATATAACCGAAGTAAAGACACAAACGATTTGCGATGATGAATGCTGTGAACAACTTAAGGTAGATATAAAGAATTACAATGACAACTACGATGAAAATCTATGTGGGAATAACAAAGATATTGGTTCTGATGAGGATATTAACGGGTACGGCACAGACGATGATAATAATGGTGACATTAATTACTACGAATATAACATCAGTGATGCCTATCATGGCAATGAGGATGGAAAACAAAATGGAGGACATACAGAGGTGGGTTTGAAAGAAGATGTCGTGTTGGAGGATACATTAAATAATGATGTCAACGAAAATGACAGAAATGATAGTAATGATGGTAATGATGATAATGATGACCGGAATCATGATGTAATTAACAAGGATGATGTTAACGTGAGCGATGTGAAGAAAATGAGAAAGAATAGATTAAAGAAATCAAAAAGAAGGGGACTTATGAAGATAACTCTGACAGTTGAAGAACAGAGAGCTGAACTAGAAGCGAAGAGGAAGGAAAAGAAGTACACAGAGGCTGAGTTCAAGTGCTATAATTGTGCTATAGGATTCCTGTTCAAGGATACCTACCAGGCGCATATGATGCGTCACGAAGAGTCTAACGGTCAATACTCGTGTCCGATATGCACCCTCCGCGTGTCGTCTCTAACGCTGCTCCGCGTCCACGCCTCCCGTCATGCGGAGCGCTCCGTGTGCGTGAGGTGCGGGGTTCGAGTCCCCGGCAGGCACCACGTATGTAAGCACACGAGGACCAGGTCCCTGCCCTGCCACATGTGCGCTAGACTGTTCACGGACGCGAGTGGTCTCCAACAACATTTAAAACGAGTCCACACCAGCAAGACCAGCGGCAGACTCCACACCTGTACCGTCTGTGGCGAGACTTACAACACGCAGGCGGCTCTGAGGACGCATATGATTAAACATATAAAACGAAAATTCCCGTGTGAGCTGTGCCCGTCGGTGTACAGCAGTCCGTACACCCTGAACCAGCACATGAAGACCCATAACCAGGTGTCGGAGACATACTACTGCGAGACCTGCAACGTCAGCTTCACCTCCAGGAAGGGGCTGATGGCTCATAGACGGAACACGCTCAAACACCAACAGACCCTCTTCGAGTGTCCGATATGTGGTCGAGTGTGTCCCAACCAGCGAGCGCTGGCCTCACACATCCAGGCCGTCCACTCGTCCAGCAAGGAGTACAGCTGTTCCATGTGCAGCTCCAGCTACACTAGCAGGAAGTCGCTGGTCAGACACGTCGGAACGCACAGGAACAGCACAGGCGGGCCGCTGGCTGTGTGTCACCTGTGCGGGAACTGTTTCAAGGTCGGCCATTACGGGTTATATGTATGGAGCGTTTTATTTGAGAGTTATTTTATGTGTGATAACGACCTTGAAGGTCCAACTCAAGCAGAAAGACGCGACAATAATGCATCTCACGAAACGCCTGGCCCTTCTGGAAAGCCAGATCAAAAGCTGCCCCACATGTACAAATAA

Protein sequence:

>DPOGS210641-PA
MSNVLCFVCYGAVHSDISDGTRAKYRDFVGISLCPESQLCYICCHILNKMCIFKSLCLKRSTDYPIFSEKDILRLHITEVKTQTICDDECCEQLKVDIKNYNDNYDENLCGNNKDIGSDEDINGYGTDDDNNGDINYYEYNISDAYHGNEDGKQNGGHTEVGLKEDVVLEDTLNNDVNENDRNDSNDGNDDNDDRNHDVINKDDVNVSDVKKMRKNRLKKSKRRGLMKITLTVEEQRAELEAKRKEKKYTEAEFKCYNCAIGFLFKDTYQAHMMRHEESNGQYSCPICTLRVSSLTLLRVHASRHAERSVCVRCGVRVPGRHHVCKHTRTRSLPCHMCARLFTDASGLQQHLKRVHTSKTSGRLHTCTVCGETYNTQAALRTHMIKHIKRKFPCELCPSVYSSPYTLNQHMKTHNQVSETYYCETCNVSFTSRKGLMAHRRNTLKHQQTLFECPICGRVCPNQRALASHIQAVHSSSKEYSCSMCSSSYTSRKSLVRHVGTHRNSTGGPLAVCHLCGNCFKVGHYGLYVWSVLFESYFMCDNDLEGPTQAERRDNNASHETPGPSGKPDQKLPHMYK-