
Suljettu
Julkaistu
Maksettu toimituksen yhteydessä
My legal-research assistant relies on a “Research” module that has stopped returning results. I need two things done together so the tool is fully functional again: 1) Build (or refine) a scraper that reliably pulls pure text data from the official Supreme Court of India website as well as each High Court site. The crawler must respect [login to view URL], handle pagination, and normalise judgments, orders and daily cause-lists into a clean structure before inserting them straight into our existing database (PostgreSQL). If you prefer Python, feel free to combine requests, BeautifulSoup, Selenium or Playwright—whatever keeps it stable and headless-friendly. 2) Trace and fix the broken “Research” feature inside the AI tool. It currently fails when it tries to query the new tables; the bug appears in the indexing layer, not the LLM itself. Once the scraper has seeded fresh data, the search endpoint should return relevant citations in under two seconds. I’ll provide full repo access, schema diagrams and sample failing calls. Acceptance criteria • Automated scraper runs on a cron, logs activity and retries gracefully • All fetched records are de-duplicated and visible in the database dashboard • Research endpoint passes provided unit tests and returns accurate matches • Code is documented well enough for hand-off and future scaling If you have prior experience with court or government sites, that will speed things up, but solid web-scraping and API debugging skills are what matter most. Let’s get this tool searching again.
Projektin tunnus (ID): 40335825
26 ehdotukset
Etäprojekti
Aktiivinen 10 päivää sitten
Aseta budjettisi ja aikataulu
Saa maksu työstäsi
Kuvaile ehdotustasi
Rekisteröinti ja töihin tarjoaminen on ilmaista
26 freelancerit tarjoavat keskimäärin ₹9 119 INR tätä projektia

Have over 18 years of experience in data mining/ Web scrapping/ Scraping Bots/ Chrome/Opera Extensions I have done it all. Tell us your source and we will put it in excel for you, Or we can even give you filtered results as per your requirement, In the format you want. You can also ask for data into a particular format - Excel, Json, Mysql, Databases, XMLs, you name them. Further Can help you with integrating it with ur databases, Can create json outputs. We are not only good with scraping but also with the tools that u may need after that. We can help you build you softwares round the data we have 99% Data Accuracy. We have Duplicate finder. etc., We can help with Statistics on the data We can help with creating Api's front the data We can create Softwares to manage that data We can build Sites round the data
₹7 000 INR 2 päivässä
6,9
6,9

Interesting project, I will build the scraper for the Supreme Court and High Court sites using Python with Playwright for headless rendering, pulling judgments, orders, and cause-lists into your PostgreSQL database with de-duplication and clean normalization. It will run on cron with logging and graceful retries. Then I will trace and fix the broken Research module — the indexing layer bug — so the search endpoint returns accurate citations under two seconds once fresh data is seeded. For court sites specifically, I will build the scraper with per-domain rate limiting and session rotation because government sites often block IPs after rapid sequential requests without returning an explicit error — they just serve empty pages. Catching that silently broken state early prevents gaps in your dataset that only surface later when a citation is missing. Questions: 1) How many High Court sites need to be covered — all 25, or a specific subset to start? 2) For the Research bug, is the indexing layer using full-text search (tsvector) or a vector database for semantic search? Looking forward to talking through the details. Kamran
₹15 500 INR 7 päivässä
6,7
6,7

I can rebuild a stable scraper for Supreme Court & High Courts (Python + Playwright/BS4) with clean normalization into PostgreSQL, plus fix your indexing bug so the Research module returns fast, accurate results. Will include cron automation, deduplication, logging, and optimized search endpoint (<2s) with proper indexing.
₹1 500 INR 1 päivässä
5,4
5,4

I already have most of the scrapper ready, i developed it using .net core 8, so easily can be customised as per your need and can published as crone job on linux vps. Moreover I can develop a RAG LLM pipeling for all Orders in pdf, so you directly query in Natural english/hindi language to get your well organized answer, Suggestions, Order Refrences etc. Drop me a message lets have quick conversation to finish it. With Regards Maroof K.
₹25 000 INR 7 päivässä
4,4
4,4

⭐ Hello there, My availability is immediate. I read your project post on Python Developer for Court Data Scraper & AI Repair. We are experienced full-stack Python developers with skill sets in: Python, Django, Flask, FastAPI, Jupyter Notebook, Selenium, Data Visualization, ETL AI/ML & Data Science: Model development, training & deployment, NLP, Computer Vision, Predictive Analytics, Deep Learning React, JavaScript, jQuery, TypeScript, NextJS, React Native NodeJS, ExpressJS Web App Development, Web/API Scraping API Development, Authentication, Authorization SQLAlchemy, PostgresDB, MySQL, SQLite, SQLServer, Datasets Web hosting, Docker, Azure, AWS, GCP, Digital Ocean, GoDaddy, Web Hosting Python Libraries: NumPy, pandas, scikit-learn, TensorFlow, PyTorch, etc. Please send a message so we can quickly discuss your project and proceed further. I am looking forward to hearing from you. Thanks
₹11 590 INR 3 päivässä
4,2
4,2

As a seasoned full-stack developer with over 7 years of experience, I have meticulously crafted end-to-end solutions for complex projects like yours. My extensive knowledge in AI development and database management, coupled with my proficiency in Python and PostgreSQL make me the prime candidate to handle your Court Data Scraper & AI Repair project. Throughout my career, I've successfully created web applications just like the one we discussed, ensuring they are not only efficient and scalable but also capable of handling large amounts of real-time data. I'm fully adept at utilizing various types of web scraping tools such as BeautifulSoup and requests library in Python to seamlessly gather substantial amounts of pure text data while respecting robots.txt. Moreover, I bring a unique perspective having previously worked on projects involving court and government websites, this gives me an added advantage in swiftly navigating through the complexity of these digital environments. As your potential long-term collaborator, I would engage in open daily communication practices and provide you with realistic timelines to ensure complete transparency. My previous clients can vouch for my meticulous documentation skills which ensures they always receive clean code that is easy to understand and built without the need for any guesswork. Let's build something impactful together!
₹12 000 INR 7 päivässä
2,7
2,7

Your AI legal assistant's 'Research' module is failing due to broken data ingestion and indexing—a two-part fix requiring a stable court-data scraper and precise backend debugging. In my Energy Label Parser project, I built an AI system to extract structured data from complex documents, directly applicable to parsing judgments and cause-lists. My Python, PostgreSQL, and web scraping skills match your stack, backed by my Applied Data Science with Python certification. I'd tackle this in two clear phases: first, building the scraper and database pipeline, then fixing the search endpoint. Can you share the schema for the new tables the 'Research' feature is failing to query, so I can target the indexing bug immediately?
₹12 500 INR 7 päivässä
0,0
0,0

I can quickly repair your legal-research assistant's 'Research' module and set up a robust court data scraper. My expertise in Python and AI will ensure accurate and timely results. Fast delivery guaranteed.
₹5 000 INR 2 päivässä
0,0
0,0

Hi, I can fix your broken Research module and build a stable, legal-website scraper for you. I have strong experience in: - Web scraping with pagination & error handling - PostgreSQL database, indexing, and query optimization - Backend API debugging and fixing search performance - Cron jobs, logging, and reliable data pipelines I can make your endpoint return results within 2 seconds, cleanly insert structured legal data, and ensure deduplication. I can start immediately and deliver well-documented, maintainable code.
₹7 000 INR 7 päivässä
0,0
0,0

Hi, I can fix your Research module by building a stable Playwright/BeautifulSoup scraper for Court data. I am an expert in PostgreSQL indexing and API debugging, ensuring your search returns results in under 2 seconds. I will provide a cron-automated, de-duplicated data pipeline with full documentation.
₹7 000 INR 7 päivässä
0,0
0,0

I can fix your 'Research' module by building a robust Python scraper (Playwright/BS4) with automated normalization for PostgreSQL. I’ll debug your indexing layer to ensure sub-2-second query speeds. With experience in legal data scraping, I'll ensure stable, cron-ready performance. Ready to start immediately.
₹1 500 INR 2 päivässä
0,0
0,0

Hi, I have experience working with backend systems and fixing API-related issues. I recently built and deployed a FastAPI-based project and can help debug and fix your issue efficiently. I can start immediately and ensure a clean and reliable solution.
₹4 000 INR 7 päivässä
0,0
0,0

Your Research module failure sounds like a indexing/query mismatch against the new tables — I've debugged similar issues where the search layer breaks after schema changes in PostgreSQL. For the scraper, I'll use Python + Playwright (headless) for the JS-heavy court sites and BeautifulSoup for static pages, with proper pagination handling, de-duplication via unique case IDs, and cron scheduling with retry logic. I can start immediately — I have experience scraping government portals that use inconsistent HTML structures. Happy to do a quick diagnostic on the failing Research endpoint first to confirm the root cause. Could you share one of the sample failing calls so I can pinpoint the exact indexing issue?
₹3 000 INR 3 päivässä
0,0
0,0

This is right up my alley - I've built similar data pipelines combining web scraping with AI search. I'd use Python with Playwright for the court sites (they tend to be JS-heavy) and BeautifulSoup for parsing judgments into clean structured data for PostgreSQL. For the broken Research module, I'll trace the indexing layer issue - likely a schema mismatch or missing index after the data model changed. Built Seekret's full monitoring and data pipeline stack (acquired by Datadog), so debugging search endpoints and fixing data flows is familiar territory. Can start right away.
₹8 000 INR 5 päivässä
0,0
0,0

New Delhi, India
Liittynyt marrask. 3, 2020
$25-50 USD/ tunnissa
$2-8 USD/ tunnissa
$8-15 USD/ tunnissa
$10-30 USD
₹600-1500 INR
$10-30 USD
$30-250 USD
$15-25 USD/ tunnissa
£20-250 GBP
$10-30 USD
$250-750 USD
$30-150 USD
₹12500-37500 INR
$30-250 USD
₹600-1500 INR
£20-250 GBP
$3000-5000 USD
₹37500-75000 INR
€30-250 EUR
$30-250 USD