SMART GST: AI Powered Circular Summarizer
Author : CA Mary Brinda John Wilson
The AI-Powered GST Circular Summarizer is a web application built using Streamlit.
It enables users—primarily Chartered Accountants, Tax Consultants, and Compliance Professionals—to scrape, summarize, translate, and analyze GST (Goods and Services Tax) Circulars issued by the Government of India.
This tool provides:
The app thus automates the tedious task of interpreting circulars manually and helps professionals stay compliant efficiently.
User Action:
Browse all GST Circulars automatically scraped from the GST Council Website.
Purpose:
View circular metadata like Circular Number, Date, Subject, File Size, and Link.
Additional Features:
Filter by Date Range, Year, or Month.
Keyword-based Search.
Smart Tagging of Circulars (e.g., "ITC", "Export", "Refund").
User Action:
Select a Circular ➔ Download the PDF ➔ Extract and Summarize contents.
Purpose:
Generate a structured executive summary automatically for ease of understanding.
Sections Summarized:
Technology:
OpenAI’s GPT-3.5 Turbo API
User Action:
Upload a custom PDF (GST Circular, Notification, Order).
Purpose:
Summarize any GST-related document, even if not scraped.
Benefit:
Extend use beyond only the scraped government circulars
User Action:
Translate any generated summary into 22 Indian Languages (Hindi, Tamil, Marathi, Bengali, etc.).
Purpose:
Make GST information accessible to a wider audience.
Format:
HTML-based translation download for non-English languages.
Feature | Description |
Web Scraper | Scrapes all GST Circulars using BeautifulSoup. |
PDF Downloader | Downloads circulars safely to local storage. |
PDF Text Extraction | Extracts readable text using pdfplumber. |
Summarizer | Summarizes extracted text via OpenAI GPT model. |
Translator | Translates text into selected Indian languages. |
Download Options | Allows users to download Summary PDFs and HTML. |
Visualization Dashboard | Displays trends using Matplotlib and WordCloud. |
Component | Technology |
Frontend & Backend | Streamlit |
Web Scraping | BeautifulSoup, Requests |
Text Extraction | pdfplumber |
Summarization & Translation | OpenAI GPT-3.5 Turbo |
Visualization | Matplotlib, WordCloud |
Document Conversion | WeasyPrint (for PDF generation) |
Dependent on GST Council Website Structure:
If website structure changes, scraping module may require updates.
Paid OpenAI API Key:
Requires a valid paid API key for full functionality.
https://www.youtube.com/watch?v=oQ40iTUFmSg