Application Guide

Welcome to the user guide for Arabidopsis Gene Network Explorer (AGNE). This document will help you understand how to use the application, the purpose of each feature, and how to interpret the results generated.

Overview

The AGNE application provides a platform for analyzing gene expression data, performing network visualization, and conducting functional enrichment analysis in Arabidopsis thaliana.

Key Features: - Upload and preprocess gene expression data. - Generate volcano plots for differential expression analysis. - Visualize gene interaction networks. - Perform GO and KEGG enrichment analyses. - Export results and plots for reporting.

Getting Started

1. Upload Your Data - The application accepts CSV files for the following:

  • Normalized Data: Contains gene expression levels.

  • Differentially Expressed Genes (DEGs): Contains log fold change (logFC) and adjusted p-values for genes.

  • Genes of Interest: An optional list of specific genes you want to focus on.

Data Format Requirements: - Normalized Data: Gene IDs as rows and sample names as columns. - DEGs: A CSV file with the following columns:

  • Gene_ID

  • logFC

  • p.adjust (adjusted p-value)

  • Genes of Interest: A list of `Gene_ID`s (one per line).

2. Set Analysis Parameters - Use the sidebar to define analysis thresholds:

  • logFC Threshold: Specify the minimum fold change for differential expression.

  • p-value Threshold: Set the significance level for adjusted p-values.

  • Regulation Type: Choose between Upregulated, Downregulated, or Both.

3. Run Analysis - Click the “Run Analysis” button to process the uploaded data and generate results.

Features and Outputs

### Volcano Plot The Volcano Plot visualizes the relationship between log fold change and significance (adjusted p-value) for all genes: - X-axis: log2(Fold Change) - Y-axis: -log10(Adjusted p-value) - Color Coding:

  • Red: Overexpressed genes.

  • Blue: Underexpressed genes.

  • Grey: Non-significant genes.

  • Hover over points to see gene-specific details.

How to Use: - Select genes of interest by dragging over the plot. - Selected genes will appear in the table below the plot.

### Gene Interaction Network The Network Visualization explores relationships between genes based on co-expression or known interactions: - Nodes represent genes. - Edges represent interactions.

How to Customize: - Adjust network layout, node size, and edge thickness. - Highlight genes of interest by uploading a list of `Gene_ID`s.

### GO Enrichment Analysis The Gene Ontology (GO) Analysis identifies overrepresented biological functions in your gene set: - Categories:

  • Biological Process (BP)

  • Molecular Function (MF)

  • Cellular Component (CC)

  • Output: - Barplot of top GO terms. - Table of enriched terms with p-values and associated genes.

How to Interpret: - Higher bar height indicates stronger enrichment. - Click GO terms to open detailed descriptions on external websites.

### KEGG Enrichment Analysis The KEGG Analysis identifies pathways enriched in your gene set: - Output:

  • Barplot of top KEGG pathways.

  • Table with pathway details and associated genes.

How to Interpret: - Larger Rich Factor values indicate stronger pathway involvement. - Click KEGG IDs to explore pathways interactively.

### Exporting Results You can export: - Plots: Download as PNG or PDF. - Tables: Save as CSV or Excel files.

Tips for Effective Use

  • Ensure your data files are correctly formatted before uploading.

  • Start with broader thresholds (e.g., lower logFC and higher p-value) to get an overview, then refine your filters.

  • Use the interactive plots to explore and validate your hypotheses.

Understanding Results

  • Differentially expressed genes help identify key players in the conditions being studied.

  • Gene Ontology and KEGG pathways provide biological context to your results.

  • Network visualization reveals relationships and central nodes within your data.

FAQs

1. What if my data file doesn’t upload? - Check that the file format is CSV and meets the required column structure.

2. Why does my Volcano Plot show no significant genes? - Ensure your thresholds (logFC and p-value) are not too strict.

3. Can I analyze non-Arabidopsis species? - The application is tailored to Arabidopsis thaliana. For other species, ensure appropriate databases are available.

We hope this guide helps you make the most of the AGNE application. For further assistance, please contact the development team or refer to the full documentation.