
Suljettu
Julkaistu
Maksettu toimituksen yhteydessä
I want to create a tool where you can put in a PDF where it will have a bunch of questions, which can be math, chemistry, or physics, where an AI is able to be able to screenshot each individual question separately, so store based on MCQ questions, and also store based on topic and subtopics. I also want to be able to handle edge cases, for example, questions on multiple pages, or a question has images or diagrams or a graph, and still be able to handle it correctly, where it will still provide a question with all those different parts already attached as a single unit, and also graph questions or image questions that might go along more than one page. I want us to make sure that this can either be in the format of a screenshot, or it can be in another format, but easily accessible and downloadable way for us to separate each question and store everything based on the topic within the syllabus of chemistry, physics, or math. I will provide all the syllabus topics and the materials with the questions, but I need an algorithm or I need some model which can do that for me.
Projektin tunnus (ID): 40338560
39 ehdotukset
Etäprojekti
Aktiivinen 14 päivää sitten
Aseta budjettisi ja aikataulu
Saa maksu työstäsi
Kuvaile ehdotustasi
Rekisteröinti ja töihin tarjoaminen on ilmaista
39 freelancerit tarjoavat keskimäärin $166 AUD tätä projektia

Hi there, I’ve carefully reviewed your project and understand you need a system that can process PDFs of mixed subject questions, intelligently separate each question (including multi-page, image, and graph-based ones), and organize them by type, topic, and subtopic for easy access and download. My approach begins with building a pipeline that combines OCR and layout-aware parsing to detect question boundaries, even across multiple pages. I will use computer vision techniques to identify text blocks, diagrams, and figures, ensuring each question is captured as a complete unit, including all related visual elements. Next, I will implement logic to classify questions into MCQ or other formats and use NLP models to map each question to the correct subject, topic, and subtopic based on your syllabus. Edge cases like split questions, embedded images, or graphs spanning pages will be handled through context linking and positional tracking. The output will be structured so each question is stored either as a clean image (screenshot-style) or structured data, with downloadable formats and clear organization for filtering and reuse. Finally, I will deliver a scalable solution with clear setup guidance so you can process new PDFs easily and extend the system over time. How would you prefer the final output; primarily as images per question, structured JSON, or both for flexibility? Warm regards, Aneesa.
$100 AUD 1 päivässä
6,3
6,3

As a seasoned Full-Stack Developer with a demonstrated track record in building successful AI systems, I bring a unique blend of skills to this project. Having tackled tasks such as object detection, image processing, and text data classification in my career, I understand the challenges of organizing complex information like in your case. I'll leverage this experience to design an intelligent tool that effectively handles questions with varied formats and intelligently organizes them based on your syllabus topics. My proficiency in Computer Vision and Machine Learning (ML) will ensure that even when questions span multiple pages or include images, diagrams, or graphs, they are accurately segregated as single units. Designing an algorithm that can efficiently process MCQs from PDFs is possible by harnessing the power of textual and visual feature extraction within ML. In other words, you'll have a dependable system that can seamlessly store each question while categorizing them based on topics and subtopics within math, chemistry, and physics. Beyond employing an algorithm or model, I am committed to providing hands-on support throughout the development process and ensuring your specific needs are met promptly and precisely."
$140 AUD 3 päivässä
5,6
5,6

Hello I hope you're doing well. I understand you're looking for AI-Powered Question Segregation Tool I am the ideal candidate for your project. I have read the provided job description and I understand what you are looking for. I have over 10+ years of experience Engineering, Chemical Engineering, Machine Learning (ML), Mathematics, Physics, Data Science, Computer Vision, Natural Language Processing .Please feel free to further discuss the requirements and timeline for the project. I'd be happy to assist you. I am ready to start right now. ✅ No Upfront Payment ✅ Release Milestone After Completion ✅ 100% Project Completion Rate You can visit my Profile https://www.freelancer.com/u/HiraMahmood4072 Thank you
$100 AUD 2 päivässä
4,3
4,3

⭐⭐⭐⭐⭐ Create a PDF Tool for Extracting Questions in Math, Chemistry, and Physics ❇️ Hi My Friend, I hope you're doing well. I've reviewed your project requirements and see you are looking for a tool to extract questions from PDFs. Look no further; Zohaib is here to help you! My team has completed 50+ similar projects for educational tools. I will develop an efficient algorithm that captures each question, including edge cases like images and multi-page questions, ensuring everything is stored correctly by topic. ➡️ Why Me? I can easily create your PDF tool as I have 5 years of experience in developing educational software, specializing in data extraction and image processing. My expertise includes Python programming, AI models, and working with various file formats. I also have a strong grip on OCR technology and data organization, ensuring a seamless solution for your needs. ➡️ Let's have a quick chat to discuss your project in detail, and I'll show you samples of my previous work. I look forward to discussing this with you in our chat. ➡️ Skills & Experience: ✅ Python Programming ✅ AI Model Development ✅ Data Extraction ✅ Image Processing ✅ Optical Character Recognition (OCR) ✅ Multi-page Handling ✅ Data Storage Solutions ✅ Topic Classification ✅ PDF Handling ✅ Algorithm Design ✅ User Interface Design ✅ Quality Assurance Waiting for your response! Best Regards, Zohaib
$150 AUD 2 päivässä
5,0
5,0

Hi there, I understand you need a system that can intelligently parse PDFs containing mixed-format questions (MCQs, diagrams, multi-page content) and extract each question as a complete, structured unit while classifying it by subject, topic, and subtopic. I have experience building document AI pipelines combining computer vision and NLP, and I can design a solution that segments questions accurately, handles edge cases like multi-page questions or embedded graphs, and preserves all associated elements (text, images, diagrams) as a single entity. My approach would involve a hybrid pipeline using PDF parsing, layout detection (for bounding boxes and question segmentation), and NLP models for classification into your provided syllabus structure. I’ll ensure each question can be exported as a screenshot or structured format (JSON + assets), making it easy to store, search, and download. Special attention will be given to edge cases such as split questions across pages and image-heavy problems to maintain completeness and accuracy. The final system will be modular, allowing you to process new PDFs easily, with clear documentation and reproducible workflows. My focus is to deliver a reliable tool that not only extracts questions but organizes them in a way that is immediately usable for study, analysis, or content management. Regards, Ahmad
$100 AUD 7 päivässä
4,1
4,1

Hey, I’ve reviewed your project and understand you’re looking for a tool that ingests PDFs containing math, physics, or chemistry questions, identifies each question (including multi-page or image/graph-based ones), and stores them individually by topic and subtopic. The tool should be able to handle MCQs, diagrams, graphs, and other embedded media, preserving each question as a single unit, and output them in a format that’s easy to download or access. I can develop a Python-based solution using PDF parsing libraries (like PyMuPDF or PDFPlumber) combined with AI vision and NLP models to detect question boundaries, capture any associated images or graphs, and classify each question according to the provided syllabus. Each question can be exported as a screenshot or structured format (PDF snippet, image, or JSON), and stored in a topic-wise hierarchy for easy retrieval. Edge cases like multi-page questions or embedded diagrams will be handled through AI-driven segmentation and content aggregation. You’ll receive a ready-to-run tool that can process batches of PDFs, automatically separate, classify, and store questions in a structured, downloadable way, along with clear documentation so you can feed new PDFs into the pipeline confidently. Best regards, Muhammad Adil Portfolio: https://www.freelancer.com/u/webmasters486
$160 AUD 4 päivässä
3,4
3,4

Hi there, I understand you need an intelligent system that can process PDFs containing mixed STEM questions, accurately segment each question (including multi-page, image-heavy, and graph-based cases), and organize them by MCQ type, topic, and subtopic. With strong experience in machine learning, computer vision, and NLP, I can build a robust pipeline that reliably extracts, groups, and classifies each question as a complete unit. My approach will combine PDF parsing (PyMuPDF/PDFPlumber) with computer vision techniques to detect question boundaries, even across pages. I will implement logic to merge multi-page questions and preserve associated diagrams/graphs. For classification, I’ll use NLP models (e.g., transformer-based or fine-tuned classifiers) aligned with your syllabus to tag topics and subtopics. Outputs will be stored as structured data (JSON/CSV) along with extracted images/screenshots for each question, ensuring easy access and download. Next, I will handle edge cases like overlapping content, inconsistent formatting, and embedded visuals, ensuring high accuracy and scalability. Deliverables: Full working script/model pipeline, structured outputs (questions + images), and organized topic-based storage. QUESTION: Do your PDFs follow a somewhat consistent format/layout, or should the system be designed to handle highly variable formats from multiple sources? Let’s chat and get started now! Regards, Shehwani.
$75 AUD 1 päivässä
3,1
3,1

I propose developing a scalable AI-powered system that can intelligently extract, segment, and organize questions from PDF documents containing math, physics, and chemistry content. The solution will accurately identify individual questions, including MCQs, and handle complex scenarios such as multi-page questions, embedded diagrams, and graphs by preserving all related elements as a single structured unit. The system will output clean, structured data (JSON and image formats), making it easy to store, search, and download each question. Additionally, I will implement a topic and subtopic classification module based on your provided syllabus to ensure proper organization of content. To ensure quality and flexibility, I recommend building this in phases, starting with a core extraction system and then enhancing it with advanced features like image association and classification. This approach ensures a reliable, scalable, and production-ready solution tailored to your needs. Best Regards, M. Sajeel
$95 AUD 3 päivässä
2,5
2,5

i’ve done very similar recently building PDF question parsers with layout detection and topic classification Do you want on-device processing or cloud (GPU) for faster layout + OCR pipelines? Should topic tagging use your syllabus rules only or allow ML classification for edge cases? I suggest using a layout model (LayoutLM/Detectron2) with OCR (Tesseract/PaddleOCR), which correctly groups multi-page questions and images as one unit. I also suggest storing outputs as JSON + image slices, which keeps data structured and easy to query or download. I will first build a parser to segment questions using layout + heuristics for page linking. Then I will attach images/graphs and run topic classification using your syllabus mapping. Finally I will export structured outputs and provide a simple UI/API. Best, Dev S.
$250 AUD 3 päivässä
2,3
2,3

Hi, that’s great to hear! Your project closely aligns with one I recently worked. In that project, I built an automated PDF-to-question extraction system using computer vision, NLP, and ML models with structured classification, topic tagging, and multi-page content handling. Your requirement for separating math, chemistry, and physics questions, preserving diagrams and graphs, and classifying them by topic fits closely with the workflow I’ve implemented before. The approach can reliably detect MCQs, structured questions, and edge cases like multi-page diagrams, ensuring each question is exported cleanly and consistently. I’d be glad to connect and share my experience in more detail over chat. Thank you. Best regards, Lazar
$100 AUD 1 päivässä
0,0
0,0

----------------------- ✅✅✅✅✅ Ready To Support You Fully ✅✅✅✅✅ ----------------------- Hi there, I understand you want a system that can intelligently parse PDFs of mixed STEM questions, isolate each question (even across pages), and organize them by type, topic, and subtopic—while preserving diagrams, graphs, and formatting as a single unit. This is a classic CV + NLP pipeline problem, and I’ve worked on similar document-processing systems. My approach would be: • Use PDF parsing + layout detection (PyMuPDF / PDFPlumber) to extract structured blocks • Apply computer vision models (e.g., layout-aware detection) to segment individual questions, including multi-page continuity • Merge related text + images into unified question objects • Use NLP classification (fine-tuned model or rules + embeddings) to tag MCQ vs structured questions and map them to your syllabus topics • Export each question as image (screenshot) or structured JSON/PDF snippets, cleanly organized and downloadable Edge cases like multi-page questions, embedded diagrams, and graphs will be handled through spatial + contextual linking logic. You’ll receive a clean, extensible Python system with clear documentation and sample outputs. I’m happy to iterate until accuracy meets your expectations. Let’s build this the right way ?
$140 AUD 5 päivässä
0,0
0,0

Hello, How are you? I have checked your job description and I’m confident I can completed exactly what you need. I have extensive experience with AI content, machine learning, computer vision, NLP, and building intelligent parsing systems capable of handling PDFs with complex scientific questions. Your project for an AI-powered question segregation tool is a perfect fit for my expertise. I can design a model that automatically identifies question boundaries, extracts MCQs, groups them by syllabus topics and subtopics, and handles edge cases like multi-page questions, diagrams, graphs, and mixed formats. The system can output clean screenshots or alternate downloadable formats so each question remains a unified, structured unit. So I think this job is an ideal match with my skills and experience. Please send me a message so that we can discuss more. Thanks
$150 AUD 1 päivässä
0,0
0,0

Hello, I’ve reviewed your project, AI-Powered Question Segregation Tool, and I’m genuinely interested. With my experience, I’m confident I can complete it efficiently and to a high standard. I have a clear understanding of your main objectives. I’ve carefully reviewed the requirements to ensure nothing is overlooked. I will deliver a final result that aligns perfectly with your expectations. With my background as a Senior Software Engineer, I have strong expertise in Engineering, Machine Learning (ML). I’ve handled projects that required deep technical understanding and accurate skill alignment. I’m committed to providing reliable outcomes that meet professional standards. I have a few questions before we get started. Could you please send me a message in the chat so we can discuss the details? Thanks, Dax Manning
$140 AUD 7 päivässä
0,0
0,0

Throughout my career, I have had great success developing AI-powered systems that push boundaries and deliver real value. Your project requires an intricate understanding of both the technical aspects of machine learning as well as the complexities of capturing questions and storing them based on multiple parameters like format, topic, and even graph/image content. I will leverage my comprehensive skillset, which includes machine learning, Python, React, to develop a robust and user-friendly tool that effectively handles all the possible edge cases.
$140 AUD 7 päivässä
0,0
0,0

Hi there, Do you want the first version to output cropped screenshots of each question, or would structured JSON and downloadable assets be acceptable if that gives better accuracy for multi page and diagram based questions? How strict does the topic mapping need to be against your syllabus, especially when one question overlaps multiple subtopics or continues across pages with shared graphs or images? I can build this as a PDF processing pipeline that detects question boundaries, keeps all related parts together, and classifies each item by format, subject, topic, and subtopic. I would focus first on a reliable sample workflow with edge cases, then turn that into a clean export system you can review and scale. I would be happy to discuss further on chat. Best Oleksandr
$140 AUD 7 päivässä
0,0
0,0

I can develop a robust AI-powered system to extract, structure, and classify questions from complex PDF documents containing math, physics, and chemistry content. The solution will intelligently parse PDFs, detect individual questions (including MCQs), and accurately separate them even across multiple pages. It will also associate diagrams, graphs, and images with their respective questions using layout and spatial analysis. The system will generate structured outputs (JSON and image formats), enabling easy storage, search, and download. Additionally, I will implement a topic and subtopic classification module based on your provided syllabus, ensuring each question is properly categorized. Special attention will be given to edge cases such as multi-page questions, embedded visuals, and mixed formatting. The final product will be scalable, efficient, and designed for real-world academic and exam content processing workflows. feel free to contact with me. Best Regards, Mudassar Niaz
$100 AUD 3 päivässä
0,0
0,0

⭐ Hello there, My availability is immediate. I read your project post on the AI-Powered Question Segregation Tool. I am an experienced full-stack Python developers with skill sets in: Python, Django, Flask, FastAPI, Jupyter Notebook, Selenium, Data Visualization, ETL AI/ML & Data Science: Model development, training & deployment, NLP, Computer Vision, Predictive Analytics, Deep Learning React, JavaScript, jQuery, TypeScript, NextJS, React Native NodeJS, ExpressJS Web App Development, Web/API Scraping API Development, Authentication, Authorization SQLAlchemy, PostgresDB, MySQL, SQLite, SQLServer, Datasets Web hosting, Docker, Azure, AWS, GCP, Digital Ocean, GoDaddy, Web Hosting Python Libraries: NumPy, pandas, scikit-learn, TensorFlow, PyTorch, etc. Please send a message so we can quickly discuss your project and proceed further. I am looking forward to hearing from you. Thanks
$230 AUD 3 päivässä
0,0
0,0

I bring a practical, problem-solving mindset with experience in automation, data processing, and structured logic design. I can build a reliable system to accurately extract and organize questions from PDFs—even with complex cases like multi-page content and diagrams—while keeping the solution simple, efficient, and easy to use.
$30 AUD 5 päivässä
0,0
0,0

Hi, I can build a robust AI-powered pipeline to extract and structure questions from PDFs with high accuracy. I’ll combine OCR (for text + images), layout detection, and NLP to segment each question—even across multiple pages. The system will group MCQs, preserve diagrams/graphs, and attach all related elements into a single unit. For classification, I’ll implement topic & subtopic tagging using ML models aligned with your syllabus. Edge cases like multi-page questions, embedded figures, and mixed formats will be handled through bounding box tracking and document structure analysis. Output can be clean screenshots, JSON, or downloadable datasets. I’ll ensure scalability and accuracy. Let’s discuss your syllabus and sample PDFs to get started.
$140 AUD 4 päivässä
0,0
0,0

Hi, this is a perfect use case for combining Computer Vision and NLP. I’ll design a model pipeline using document layout analysis + OCR + semantic parsing to extract each question as a complete unit. Multi-page questions, diagrams, and graph-based content will be preserved using spatial linking and context-aware grouping. I’ll also train/implement a classifier to map questions to your syllabus topics and subtopics. Deliverables can include: • Question-level screenshots • Structured datasets (JSON/CSV) • Organized topic-wise storage The system will be accurate, efficient, and built to handle real-world PDF inconsistencies. Happy to start with a small sample test.
$100 AUD 6 päivässä
0,0
0,0

Sydney, Australia
Maksutapa vahvistettu
Liittynyt maalisk. 31, 2026
$30-250 AUD
$30-250 USD
$15-25 CAD/ tunnissa
$750-1500 USD
$15 USD
$30-250 USD
$3000-5000 USD
₹4000-5000 INR
$15-25 USD/ tunnissa
₹600-1500 INR
₹1500-12500 INR
$25-70 USD/ tunnissa
$10 USD
min $100000 USD
₹600-1500 INR
₹1500-12500 INR
$10-30 USD
$10000-20000 USD
₹12500-37500 INR
$30-250 NZD
€8-30 EUR