
Suoritettu
Julkaistu
Maksettu toimituksen yhteydessä
Necesito automatizar la consulta de la siguiente página del SRI: [login to view URL] Al ingresar un RUC o cédula debo obtener y guardar en un archivo JSON estos campos exactos: • Estado de Contribuyente • Razón social • Indicador de “Contribuyente fantasma” • Actividad económica principal Requerimientos técnicos: – El script debe funcionar bajo llamada, es decir, pueda ejecutarlo manualmente cada vez que lo necesite con una lista de RUCs como entrada. – La salida debe ser un json por cada Identificación consultada. – Incluye un breve README con instrucciones de instalación y uso. Criterios de aceptación: 2. El tiempo medio por consulta no debe exceder lo razonable para evitar bloqueos (puedes aplicar esperas aleatorias o rotación de user-agent). 3. Código claro, comentado, sin dependencias innecesarias. Si ya tienes experiencia scrapeando portales gubernamentales o manejando captchas, coméntame brevemente tu enfoque y plazo estimado. Es entrega del script completo, NO COMO SERVICIO.
Projektin tunnus (ID): 40226848
62 ehdotukset
Etäprojekti
Aktiivinen 16 päivää sitten
Aseta budjettisi ja aikataulu
Saa maksu työstäsi
Kuvaile ehdotustasi
Rekisteröinti ja töihin tarjoaminen on ilmaista

Soy latino como tu, así que por zona horaria y cultura, nos podríamos llevar muy bien. Mis trabajos han sido calificados con 5 estrellas. Tengo amplia experiencia scrapeando sitios web, te doy algunos ejemplos. * Portales inmobiliarios en españa. * Sitios de partes de vehiculos para todo Chile. * Obtención de puntos de distribución de paquetes como Amazon, Fedex, DHL y otros propios de cada país ( España y Francia). * Datos de instituciones educativas. Este proyecto estimo entregarlo en un solo día la codificación y la data según el volumen de datos que tengas. * Tu suministrarás una lista de documentos y la salida será como tu lo indicas. * Tendré en cuenta estrategias antibaneo. * Código documentado sin librerías y dependencias ajenas.
$250 USD 1 päivässä
5,0
5,0
62 freelancerit tarjoavat keskimäärin $453 USD tätä projektia

I have extensive experience in PHP, Python, Data Processing, Web Scraping, and C# Programming, making me a great match for the "Scraper SRI datos contribuyente" project. I am confident in my ability to automate the data retrieval process from the SRI website efficiently. Rest assured, I can adjust the budget based on the project scope and ensure timely delivery. Please review my 15-year-old profile to see my past work. Let's discuss the details and get started on this project. I am eager to show my commitment and start working right away. Looking forward to hearing from you.
$525 USD 10 päivässä
8,7
8,7

Hi I can develop you a Python script to automate data query from SRI website, for user provided list of RUC/ID numbers, and extract required data into JSON format. I will provide you README instructions to setup and run the program on your end. I'm available to start right away. Abdul H.
$250 USD 1 päivässä
7,8
7,8

Hi there, I understand that you need to automate the querying of the SRI website to retrieve specific information based on RUC or cédula numbers. This includes fields such as the taxpayer's status, business name, ghost taxpayer indicator, and main economic activity. I will develop a script that allows you to manually execute queries with a list of RUCs, ensuring that the output is formatted as JSON for each identification consulted. I will implement random waits and user-agent rotation to prevent blocking, adhering to your acceptance criteria. My approach focuses on writing clear, well-commented code without unnecessary dependencies, ensuring reliability and ease of use. I will also provide a brief README with installation and usage instructions. I prioritize effective communication and quality in my work, and I am confident that I can deliver this project within the estimated timeframe. Looking forward to the opportunity to work together. Best regards, Burhan Ahmad from TechPlus
$350 USD 2 päivässä
8,2
8,2

⭐⭐⭐⭐⭐ Automate RUC Queries and Save Data in JSON Format ❇️ Hi My Friend, I hope you are doing well. I've reviewed your project requirements and see you are looking for a script to automate RUC queries on the SRI website. You don't need to look any further; Zohaib is here to help you! My team has successfully completed 50+ similar projects for web scraping and data extraction. I will create a script that captures the required fields and outputs them in JSON format, ensuring clarity and efficiency. ➡️ Why Me? I can easily automate your RUC queries as I have 5 years of experience in web scraping, data extraction, and Python programming. My expertise includes working with APIs, handling JSON data, and ensuring compliance with web scraping standards. Additionally, I have a strong grip on techniques to avoid blocks and manage user-agent rotation. ➡️ Let's have a quick chat to discuss your project in detail. I can provide samples of my previous work and explain how I will deliver your project efficiently. Looking forward to discussing this with you in chat. ➡️ Skills & Experience: ✅ Python Programming ✅ Web Scraping ✅ JSON Data Handling ✅ API Integration ✅ Data Extraction ✅ Script Optimization ✅ Error Handling ✅ User-Agent Rotation ✅ Clear Documentation ✅ Git Version Control ✅ Selenium Automation ✅ Timeout Management Waiting for your response! Best Regards, Zohaib
$350 USD 2 päivässä
8,1
8,1

As a seasoned technology partner at WellSpring Infotech, I bring over a decade of hands-on experience developing custom web and mobile applications, with a keen focus on data retrieval and management. My proficiency in PHP and Python, two highly adept languages for web scraping, makes me an ideal fit for your Scraper SRI datos contribuyente project. My approach to scraping government portals involves implementing random waits and user-agent rotation to mitigate any potential issues such as IP blocking. Additionally, I am well-versed in maintaining code clarity while avoiding unnecessary dependencies - a key requirement you've specified. Being deadline-driven, my estimated timeframe for delivering the completed script aligns strongly with your need for speed and accuracy. Thanks...
$750 USD 7 päivässä
7,8
7,8

With over 13 years of experience, I bring a wealth of expertise in Python web automation and scraping that would be extremely valuable for your project. Having developed tailored solutions for similar data extraction projects, I understand the criticality of timely and accurately retrieving information from governmental portals while bypassing potential blockages or captchas. Adhering to your core requirements, I'll create a script that can be manually executed with your list of RUCs as input. I believe what differentiates me is not just my ability to efficiently get the work done, but the quality and clarity of my deliverables. Your JSON requirement will be met without adding any unnecessary dependencies. Moreover, I am familiar with employing techniques like random waits and user-agent rotation to maintain reasonable query time. Since you require the script rather than a service, you can expect clean, commented code emphasizing efficiency and robustness delivered within the agreed timeframe. My experienc
$500 USD 2 päivässä
7,1
7,1

Your SRI scraper will fail if you don't handle their CAPTCHA and session token rotation correctly. I've seen three freelancers deliver broken scripts for this exact portal because they ignored the anti-bot middleware that kicks in after 5 requests. Before I architect the solution, I need clarity on two things: 1. Does your RUC list contain 50 identifications or 5,000? The approach changes completely - small batches can run sequentially with random delays, but high volume needs proxy rotation and distributed requests. 2. Are you running this from Ecuador or internationally? SRI blocks non-EC IPs aggressively, which means you'll need residential proxies or a VPS in Quito to avoid instant 403 errors. Here's the technical approach: - PYTHON + SELENIUM: Headless Chrome with undetected-chromedriver to bypass Cloudflare fingerprinting and handle dynamic JavaScript rendering that blocks basic requests library calls. - CAPTCHA HANDLING: Integrate 2Captcha API for automated solving when SRI triggers challenges (happens randomly every 10-15 requests). Falls back to manual intervention if budget is tight. - SESSION MANAGEMENT: Rotate user-agents and maintain cookie persistence across requests to mimic human behavior. Implements exponential backoff when rate limits hit. - JSON OUTPUT: Structured schema with error logging - if a RUC returns "No encontrado" or times out, it logs the failure reason instead of breaking the entire batch. - RETRY LOGIC: Automatic retry with 3-5 second delays for network failures or temporary blocks, then marks the RUC as "requires manual review" after 3 attempts. I've scraped 4 Latin American government portals including SUNAT (Peru) and AFIP (Argentina) where anti-bot measures are similar. The key difference with SRI is their session tokens expire every 8 minutes, so the script needs to refresh the base URL periodically. I don't take projects where the client expects 100% success rate on government scrapers - there will always be 2-5% that need manual verification due to portal downtime or CAPTCHA failures. Let's discuss your volume and timeline so I can give you a realistic delivery estimate instead of promising something that breaks in production.
$450 USD 10 päivässä
6,3
6,3

Hola, He revisado los detalles de su proyecto y definitivamente puedo ayudarle con la automatización de la consulta en la página del SRI. Tengo más de 10 años de experiencia en desarrollo de scripts para scraping web, utilizando herramientas efectivas en Python, PHP y C#. Mi enfoque es crear un código claro y eficiente que cumpla con sus requisitos y evite bloqueos mediante técnicas de espera aleatoria y alternancia de user-agents. Primero, crearé un script que reciba una lista de RUCs y devuelva un archivo JSON con los campos que ha mencionado. Además, incluiré un README con instrucciones detalladas para la instalación y uso del script. Aquí está mi portafolio: https://www.freelancer.in/u/ixorawebmob Estoy muy interesado en su proyecto y me encantaría discutir más detalles. ¿Me podría indicar si tiene alguna preferencia en cuanto a la tecnología utilizada para este script, o si le gustaría revisar algo más específico? ¿Tiene alguna preferencia en cuanto a la tecnología utilizada para este script, o le gustaría revisar algo más específico? ¡Espero su respuesta! Saludos, Arpit Jaiswal
$250 USD 25 päivässä
7,0
7,0

Hello Dear! I write to introduce myself. I'm Engineer Toriqul Islam. I was born and grew up in Bangladesh. I speak and write in English like native people. I am a B.S.C. Engineer of Computer Science & Engineering. I completed my graduation from Rajshahi University of Engineering & Technology ( RUET). I love to work on Web Design & Development project. Web Design & development: I am a full-stack web developer with more than 10 years of experience. My design Approach is Always Modern and simple, which attracts people towards it. I have built websites for a wide variety of industries. I have worked with a lot of companies and built astonishing websites. All Clients have good reviews about me. Client Satisfaction is my first Priority. Technologies We Use: Custom Websites Development Using ======>Full Stack Development. 1. HTML5 2. CSS3 3. Bootstrap4 4. jQuery 5. JavaScript 6. Angular JS 7. React JS 8. Node JS 9. WordPress 10. PHP 11. Ruby on Rails 12. MYSQL 13. Laravel 14. .Net 15. CodeIgniter 16. React Native 17. SQL / MySQL 18. Mobile app development 19. Python 20. MongoDB What you'll get? • Fully Responsive Website on All Devices • Reusable Components • Quick response • Clean, tested and documented code • Completely met deadlines and requirements • Clear communication You are cordially welcome to discuss your project. Thank You! Best Regards, Toriqul Islam
$250 USD 5 päivässä
5,9
5,9

Hola, estoy seguro de poder ayudarte con la automatización de la consulta en la página del SRI. Tengo experiencia en web scraping y automatización de procesos similares. Mi enfoque incluirá el desarrollo de un script en Python que pueda ser ejecutado manualmente para obtener y guardar la información requerida en archivos CSV. Utilizaré técnicas como esperas aleatorias para evitar bloqueos y garantizar un tiempo de respuesta razonable por consulta. Mi plazo estimado para la entrega del script es de [Suggest minimum number of days...]. ¿Cómo puedo asistirte con este proyecto?
$500 USD 7 päivässä
5,5
5,5

I can do it. As 9+ years experiences in these field. I can give good quality work. I have read the guidelines of your work.I believe that i can provide you the best quality works you are anticipating from this platform give me a chance to show you the best i can do at your service.
$500 USD 7 päivässä
6,0
6,0

https://www.freelancer.com/projects/data-scraping/Automated-Counterfeit-Detection/reviews Dear. Nice to meet you. I am very pleasure to submit my proposal on your scrapping and automation project. I have many experiences in these field using python. Recently, I developed Automated Counterfeit Detection and Reporting System on Amazon. You can check this in my portfolio. I am sure and I can start immediately. I will wait for your good news. Thank you.
$250 USD 3 päivässä
5,6
5,6

¡Hola! Tenemos amplia experiencia extrayendo datos de portales gubernamentales, incluyendo sitios con validaciones CAPTCHA y medidas anti-bot. Somos un equipo de 62 profesionales con más de 9 años de experiencia en web scraping, automatización y Python. Entregamos el script completo, no como servicio. Here's how we can help: - Script en Python que consulta el portal del SRI, extrae exactamente los cuatro campos requeridos y guarda un JSON por cada identificación - Manejo de esperas aleatorias, rotación de user-agents y reintentos inteligentes para evitar bloqueos — tiempos de consulta razonables - Código claro, comentado en español/inglés, con dependencias mínimas (requests, beautifulsoup4, lxml) - README con instrucciones paso a paso de instalación y uso - Entregable listo para ejecutar manualmente con lista de RUCs como entrada He scrapeado portales similares de Ecuador, Chile y México. Plazo estimado: 4-5 días, incluyendo pruebas con diferentes RUCs. ¿El sitio actualmente presenta CAPTCHA frecuentemente? ¿Tienes preferencia por Python puro o podemos usar selenium si es necesario?
$500 USD 7 päivässä
5,4
5,4

Hello, I can build a Python script to query the SRI RUC page and extract the required fields, saving one JSON per ID. It will accept batch input, use random delays and rotating user-agents to avoid blocks, and be clean, well-commented, with minimal dependencies plus a README. I have experience scraping government portals and handling anti-bot measures. Estimated delivery: 2–3 days.
$300 USD 2 päivässä
5,0
5,0

As a multi-disciplinary data scientist and experienced web scraper, I am well-equipped to handle your project "Scraper SRI datos contribuyente". My mastery of Python, one of the most powerful languages for web scraping, ensures that I have the skills needed to extract and save your required JSON fields accurately. My firm understanding of JSON structures will ensure neat output and convenient utilization. I understand the unique challenges posed by governmental websites, such as captcha and restrictions on request frequency. My experience dealing with those obstacles suggests implementation of randomized waiting times and user-agent rotation to maintain standard request patterns. This approach prevents blocking due to excess queries. As a result, my estimable timeline for your project guarantees fast turn arounds that keep you ahead. My comprehensive portfolio demonstrates an exhaustive list of technical skills that aligns perfectly with your project requirements. From scraping through data management to crafting APIs for easy usage, I guarantee efficient deployments. Beyond just executing the requested task, I provide added value by documenting all steps within a README file, facilitating future use and maintenance. Trust me with your project and experience 'efficiency' & 'expertise' in their true sense.
$250 USD 7 päivässä
5,2
5,2

As a seasoned web and mobile app developer with over 9 years in the field, I have extensive experience in PHP, which is a fundamental skill for web scraping. I have worked on various projects that required extracting data from different websites, including government portals. With regards to dealing with captchas, I am well-versed in implementing automated solutions such as waiting times and rotating user-agents to prevent blocking and enhance efficiency. Moreover, I understand how vital it is to deliver clean, commented code without unnecessary dependencies. You can be assured that the script I will provide you will not only be efficient but also easy to understand and use. Alongside the script, I will also include a detailed README file to guide you through the installation and operation process. Most importantly, I value quality work and timely delivery. I will adhere strictly to your project's timeframe while ensuring that the average query time remains reasonable to avoid suspicious activity detection. My ultimate goal is turning your ideas into reality with little disruption for your day-to-day operations. Partner with me for this project and redefine your expectations!
$500 USD 7 päivässä
5,4
5,4

Soy Leo Sarmiento, desarrollador Full‑Stack con 10 años de experiencia en automatización. Propongo desarrollar un script en Python con Playwright que consulte una lista de RUCs, extraiga Estado, Razón social, fantasma y actividad económica, y genere JSON por cada identificación; el script incluirá esperas aleatorias, rotación de user‑agent y manejo de captcha opcional, junto con un README detallado para su ejecución local. ¿Podemos coordinar una breve llamada para precisar el volumen de consultas y confirmar el enfoque?
$300 USD 5 päivässä
4,9
4,9

Soy especialista en extracción de datos. Si revisas mi perfil, verás reseñas de scraping ya realizados con éxito, lo que garantiza un enfoque probado en entornos con bloqueos complejos. Enfoque Técnico Tecnología: Usaré Python con Playwright. Es más rápido que Selenium y maneja mejor el renderizado dinámico del SRI para asegurar que el "Indicador Fantasma" se capture tras cargar la página. Evasión de Bloqueos: Implementaré rotación de User-Agents y esperas aleatorias (comportamiento humano) para minimizar el riesgo de baneo de IP. Manejo de Captcha: Dejaré el script listo para resolución manual asistida o integración con APIs (2Captcha/Anticaptcha) si requieres automatización total. Entregables Script Python: Modular, recibe lista de RUCs y genera un JSON individual por consulta. Campos: Estado, Razón Social, Contribuyente Fantasma y Actividad Principal. README: Guía de instalación y uso en 3 pasos. Plazo: 3-4 días (incluye pruebas de estrés).
$350 USD 7 päivässä
4,9
4,9

Como desarrollador web experimentado especializado en automatización y extracción de datos, puedo asegurarle que soy la persona ideal para su proyecto de scraping. Domino lenguajes como Python y C#. Estoy familiarizado con Playwright y Selenium. En cuanto a sus preocupaciones de tiempo, creo firmemente en optimizar el rendimiento de los scripts. Puedo implementar esperas aleatorias y estrategias de rotación de agentes de usuario para mantener la duración de las consultas dentro de un rango razonable y minimizar el riesgo de bloqueo. Con este enfoque, le aseguro que el tiempo de respuesta para cada consulta se mantendrá preciso. Mi experiencia se extiende no solo a la extracción de datos, sino también al manejo de escenarios de automatización complejos, como la gestión de portales gubernamentales y CAPTCHAs. Por lo tanto, comprendo con precisión los matices de los desafíos del proyecto y puedo ofrecer soluciones inteligentes para no exceder ninguna restricción y recuperar toda la información requerida correctamente. Permítame ofrecerle un camino claro hacia la automatización; ¡no se arrepentirá!
$350 USD 7 päivässä
4,3
4,3

Drawing from my strong background in web development, particularly in C# programming and web scraping, I am an ideal choice for your project. Not only do I have a deep understanding of the technical aspects required for this job, but I am also experienced in working with government portals and handling captchas. Given the importance of avoiding blocks and ensuring reasonable waiting times per query, I would implement random wait times and user-agent rotation strategies to produce optimal scraping results. My familiarity with PHP, Python and Node.js means that I can complete this task effectively regardless of your preference. In terms of deadline, my efficient coding practices assure you that I can deliver a fully functional script within the agreed-upon timeframe. Lastly, I want to emphasize that as a full-stack developer, I don't just focus on getting the job done – I also prioritize code clarity and usability. Rest assured, the script will be well-commented with no unnecessary dependencies.
$500 USD 7 päivässä
3,8
3,8

Quito, Ecuador
Maksutapa vahvistettu
Liittynyt lokak. 22, 2019
$10-30 USD
$30-250 USD
$30-250 USD
$30-250 USD
$10-30 USD
₹750-1250 INR/ tunnissa
₹100-400 INR/ tunnissa
₹1500-12500 INR
$250-750 AUD
₹750-1250 INR/ tunnissa
$750-1500 USD
$30-250 USD
$8-15 USD/ tunnissa
₹12500-37500 INR
₹750-1250 INR/ tunnissa
₹1500-12500 INR
£20-250 GBP
£250-750 GBP
$750-1500 USD
₹1500-12500 INR
₹600-1200 INR
£20-250 GBP
₹600-1500 INR
$500-1000 USD
₹600-1500 INR