Executive Summary
A fully automated AI-driven accounting engine that converts raw excel bank statements into:
✔ Clean and Structured ledgers and auto-classified Income & Expense Heads and creation of new ledgers.
✔ 44AD Computation Sheet (Comparison between Old and New Regime) in single view.
✔ Auto-prepared ITR-4 figures.
✔ Exportable Trading and Profit Loss Account and Balance Sheet PDFs and Excel Files.
The system removes 90% of the manual effort traditionally required for presumptive taxation cases under Section 44AD by automating bank narration interpretation, ledger grouping and tax computation.
Built using Python, AI models, Streamlit and Excel automation, it provides a smart, accurate and scalable DIY tax engine for Chartered Accountants.
Industry Context and Background
A majority of India’s MSME taxpayers file ITR-4 (44AD Presumptive Taxation). Out of 7.28 Cr ITR Filed in AY 2024-25, 1.88 CR ITR were filed under Presumptive Taxation Scheme (25.77% of Total ITR Filed for the year)
Despite its simplicity, CAs still spend significant time on:
- Cleaning Raw Bank Statements and understanding Cryptic Narration Codes different for every bank.
- Deriving Turnover from Bank Flows.
- Preparing Final Accounts for compliance purposes.
- Computing Presumptive Income Manually for Computation of Tax.
BUT no enterprise-grade tool exists that can:
🔹 Parse Bank statements in excel format of any bank (ICICI, BOB, HDFC, Axis, Kotak, SBI etc.) to creation of ledgers, mapping of ledgers and compute tax in a single go from start to end.
🔹 Generate financials instantly in single window for view of calculation of Net Profit and JV Adjustments.
🔹 Affordable for small CA firms without using two three different software’s and lot of manual efforts.
Most CAs still rely on Excel + manual review, which is slow, time consuming, error-prone and not scalable.
PART A - Problem Statement
“Bank statements contain the complete financial story — but extracting accounting meaning from them requires hours of manual work for computation of tax”
Key challenges:
- No standard format across banks as narrations are messy and cryptic (UPI/IMPS/ACH/ECOM codes).
- Personal vs business segregation is manual and every bank uses their own formats of Narrations.
- Tally import requires clean ledgers and Ledgers must be created from scratch and manual accounting wastes time.
- Turnover for 44AD must be reconstructed with manual punching of figures.
- Compliance deadlines are strict and data to be processed needs a lot of time against the revenue generated for filing presumptive returns.
This leads to:
- Hours wasted per client
- Errors in turnover classification (especially UPI/IMPS credits)
- Inconsistent accounting treatment
- Difficulty scaling during peak season
Why Existing Solutions Fail
❌ Generic bank statement converters
Do NOT understand Indian taxation requirements, GST or CA-specific requirements.
❌ Tally import tools
Only map data—they do not compute tax or generate ITR-4 numbers. For that we need another software with manual plug in of figures.
❌ Not suitable for confidential financial data
Many solutions are cloud-based → privacy concerns for CA practice.
PART B - AI-Powered Solution Overview
A completely automated pipeline:
Bank Statement → Structured Dataset
Reads Excel/ CSV from all major banks and cleans date formats, debit/credit detection, and remarks.
AI Narration Classifier
Understands Indian narration patterns using rule-based + ML hybrid model:
UPI ID patterns, IMPS structure, POS/ECOM codes, Loan/EMI, Cash deposits/withdrawals
Ledger Mapping Engine
Auto-suggests ledger for each transaction based on predefined rules and option to change the mapping to the ledgers and helps in creation of new ledgers
Automated Final Accounts Generator
Creates:
- Turnover based on bank transactions taking reference of Opening Trial Balance figures and bank transactions.
- Calculation of Presumptive Income and Profit under 44AD (@ 6% / 8%).
- Passing of Expense and Income Entries using JV Module and creating new ledgers.
- Capital account movement of Home Loan, Interest Income, Capital Gains and other items of income.
- Sundry balances of Final Accounts to be reported in ITR-4 and Cash/bank reconciliations.
-
-
ITR-4 Ready Output
Generates:
- Profit u/s 44AD taking into account the Opening Balances, Bank Transactions and JV Entries.
- Gross receipts and Balance Sheet Figures (ITR-4 Schedule BP).
- Comparison of Tax as per Old Regime and New Regime taking into account the Finance Act of the relevant Assessment Years.
Output Formats
- Excel and PDF downloadable Final Accounts
-
Why This Tool is Different
- Built by a CA for CAs
- Uses Indian narration intelligence
- Fully offline (privacy safe)
- Zero recurring cost
- Customizable for any client/bank
- AI + Rule-based hybrid → Highest accuracy
- Includes 44AD Tax Computation + BS + P&L
- First-of-its-kind: “Bank → Tax” pipeline in India
PART C -Technologies Used
Programming & Libraries
- Python (Vibe Coding using Cursor.AI)
- pandas
- numpy
- openpyxl
- reportlab (PDF generation)
- regex & rule-based narration parser (based on patterns identified from Chat GPT)
- Streamlit (Interactive UI)
-
-
AI Layer
- Prompt-engineered models (Chat GPT) for narration understanding
- Reinforcement Learning via user-input feedback
- Streamlit (Interactive UI)
-
-
Data Inputs
- Opening Trial Balance for the year
- Bank statements in excel and CSV formats (ICICI, BOB, Axis, Kotak, SBI, HDFC etc.)
- JSON Files for Loading the Data and work last saved in case of data getting refreshed
- Ledger master (List of Generic Master Ledgers normally used in predefined classification for accounts)
- 44AD rule engine (YAML File with hardcoded rules for Tax Calculation)
Implementation Workflow
Step 1 – Upload Bank Statement and Opening Trial Balance
Handles:
- Excel and CSV supported files
Automatically standardizes formats.
Step 2 – Narration Intelligence Engine
Classifies entries into:
- Cash deposits
- Capital account
- Withdrawals
- Bank charges
- EMI/Loan
- UPI/POS/IMPS
Step 3 – Ledger Suggestions with Instant Update
- Per-row “Map Ledger” button
- Updates instantly (no double clicks)
- Saves mapping permanently
- Retains user corrections
Step 4 – Final Output Generator
Produces:
- Profit & Loss A/c and Balance Sheet (Preview on screen – for changes before PDF generation).
- 44AD Computation as per New Regime and Old Regime and suggestion for selection of regime.
- Detailed Computation Sheet with Computation of Gross Total Income, Net Income and Tax Break - Up.
- PDF and Excel Export (Final Accounts)
Outcomes & Benefits
Quantitative Benefits
- 80–90% reduction in time spent per client
- 60% fewer errors in turnover recognition
- 5x faster generation of P&L and Balance Sheet
- 100% consistent classification logic
Qualitative Benefits
- Better client insights
- CA can handle more clients in less time
- Improves compliance quality
- Enhances digital practice model
Summary & Conclusion
This use case demonstrates how AI + Automation can transform the simplest yet most time-consuming compliance workload for CAs.
What earlier required hours of:
✔ Narration reading
✔ Classification
✔ Ledger mapping
✔ 44AD turnover computation
✔ Preparing financial statements
…now happens in minutes, with accuracy, speed and privacy.
“From Bank Statement to ITR-4 — Automated, Efficient & CA-Friendly.”
