Please wait while HSF is performing calculation...

Start an Analysis

Select an analysis type

Ensembl database release: 70 (February 2013)

You have to choose an analysis mode prior to process your sequence(s). Please find below a short description of all analysis modes available in HSF:

  • Analyze a sequence. Either enter your own sequence or select a given gene and find splicing elements.
  • Analyze mutation(s). Visualize the impact of a given mutation on splicing elements of either your own sequences or a selected transcript.
  • Branch point analysis. Enter your own sequence and find potential branching points.
  • Splice site analysis. Either enter your own sequence or select a given gene and find potential acceptor and donor sites for splicing.
  • Multiple data extraction. Enter a list of Ensembl transcript ID associated with mutations and proceed to analysis.
Automatically select the longest transcript: Yes No
Search for SNPs related to the analyzed sequence: Yes No
More database options

Choose a sequence by

You can query HSF's database by using different sequence types:

  • Gene name. The name of the gene, as stated by the HUGO Gene Nomenclature Committee.
  • Ensembl transcript ID. The transcript identifier given by Ensembl.
  • Ensembl gene ID. The gene identifier given by Ensembl.
  • Consensus CDS. The Consensus CDS shared by Ensembl and the NCBI.
  • RefSeq Peptide ID. The peptide identifier given by NCBI. Please note that the processed sequence comes from Ensembl, not NCBI.
  • Pasting your own sequence. Paste directly your sequence in the following box(es). All caracters that do not match the nucleotide requirements (A, C, G or T) will be eliminated prior to analysis.


Detailed help

Select an analysis type

Analyze a sequence

Either enter your own sequence or select a given gene and find splicing elements. This mode allows you to perform a complete analysis on a single sequence. It can be followed by the visualization of the impact of a mutation (substitution, insertion, deletion) or the impact study of SNPs related to this sequence.

Analyze mutation(s)

Visualize the impact of a given mutation on splicing elements of your own sequences or a selected transcript from the database (Ensembl 70). This mode allows you to either perform a complete analysis of several mutants against a transcript stored in our database (batch analysis) or compare two sequences.

Each mutant must be described at the cDNA level according to the mutation nomenclature. This application currently supports:

  • substitutions - c.10019G>A
  • insertions - c.1542_1543insA
  • deletions - c.2230_2231del
  • duplications - c.5434_5437dup
  • indels
  • inversions

Mutants must be separated by either a space character or a breakline character. The system will automatically and independantly for each mutant check for the proper exon and introns.

This mode can be followed by a SNP search or a "quick mutant" analysis on either the reference or mutant sequences or both.

Branch point analysis

Enter your own sequence and find potential branching points. This mode can be followed by a SNP search or a "quick mutant" analysis.

Splice site analysis

Either enter your own sequence or select a given gene and find potential acceptor and donor sites for splicing. This mode allows you to perform an analysis on a sequence to find potential donor and acceptor splice sites. It can be followed by a SNP search.

Multiple transcript analysis

Enter an Ensembl transcript ID followed by three slashes and a mutant written according to the international mutation nomenclature (at the cDNA level). Different mutants must be separated by either a space character or a breakline character.

Choose a sequence

Gene name

Please note that if you have difficulties finding a gene using a gene symbol, you can check the HUGO Gene Nomenclature Committee for its proper symbol.

Ensembl transcript ID

Select a transcript corresponding to its Ensembl transcript identifier.

Ensembl gene ID

Select a gene corresponding to its Ensembl gene identifier.

Consensus CDS

Select a transcript corresponding to its Consensus CDS. NCBI and Ensembl share the same sequence for this transcript.

RefSeq Peptide ID

Retrieve in Ensembl's database a gene/transcript corresponding to the NCBI RefSeq Peptide identifier.

Paste your own sequence

Instead of using our database, you may wish to paste our own sequences. Before analyzing them, the system will clean your sequences and remove all characters that aren't A, C, G or T.

Options

Automatically select the longest transcript

If you are sure that you must perform an analysis on the longest transcript of a given gene, this option will force the system to choose the longest transcript for you. Please note that if at least two transcripts match the requirement of being the longest, a screen will be displayed for you to choose the correct one. This option doesn't work if you use a Ensembl Transcript ID as a sequence identifier. Default value: Yes.

Single Nucleotide Polymorphisms

Human Splicing Finder is able to check on the Ensembl variation database for SNPs related to the analyzed sequence. Default value: No.

Nucleotides surrounding the exon

You can decide to display intronic nucleotides surrounding the exon. By default HSF will use a value of 0. Intronic nucleotides are limited to 1000 at 5' and 3' ends of the exon. If you type a value superior to 1000, the system will automatically change it to 1000. Default value: 0.

Advanced parameters

These parameters only work for mutants and multiple transcript analysis.

  • Process sequences: To save computation time, HSF will only process the region where the mutation is located. To view all elements on the whole sequence, please use the "Full sequence" option. Default: Only variants.
  • Add images: To speed up calculation, images are disabled by default. You can enable them with this option. Default: No.

Matrices and thresholds

You can use all or a selection of matrices listed in this field. Furthermore, you can adjust threshold values by entering the desirated values in the appropriate boxes. Thresholds range from 0 to 100 in all matrices used in HSF. Available matrices are listed in the "Select matrices" section.