-
Notifications
You must be signed in to change notification settings - Fork 376
Improve API docs of public components #304
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
### Description - Improve docstrings of storage classes. - I also changed the list of main classes to reflect at least "somehow" the current public interface. ### Issues - Relates: #304 ### Testing - Website was rendered locally. ### Checklist - [x] CI passed
Hi @vdusek, I’d like to work on this issue to improve the API docs for the listed public components. Before starting, I have a few quick questions:
Looking forward to your guidance! |
Hi @belloibrahv, thanks for your interest in Crawlee.
|
Hi @vdusek, I'd like to contribute to improving the API docs for the public components as described in this issue. After reviewing the provided references, guidelines, and the list of components, here's how I plan to approach this task:
I plan to start with the Is this approach aligned with your expectations? Do you have any additional guidance or specific areas you'd like me to focus on within these unchecked components? Thank you for the opportunity to contribute to Crawlee! Best regards, |
@belloibrahv That would be great, thanks. |
@vdusek , Thank you for your feedback. I’ve reviewed the JS API class you mentioned (https://github.com/apify/crawlee/blob/master/packages/basic-crawler/src/internals/basic-crawler.ts) and understand your concerns. For issue #304, would it still be appropriate to proceed if I ensure strict adherence to the guidelines and requirements this time? Additionally, since you mentioned that Thank you for your help and patience. |
Hi @belloibrahv, if you're still interested in working on this, I can provide further guidelines on what we're expecting. I recommend focusing on one of the more "high-level" HTTP-based crawlers - For inspiration, take a look at some of the already completed classes in the checklist above, such as Please make only relevant changes. Stay within the scope of this issue/PR, and ensure that the pull request has a clear, single objective. For this PR, that means updating the class docstring. You can also modify the method docstrings if you're confident, but avoid making changes outside the intended scope. You should now be familiar with the Google-style docstrings, as we discussed in your previous PR. If you choose one of the HTTP-based crawlers, you should describe how they inherit from It should not be just about using ChatGPT or any other LLM. Of course, you can use them, but first, you need to understand the code & issue you're trying to solve. So that it can provide you with a meaningful output. Choose one of the crawlers I suggested, try to understand the code, and write something meaningful. Thank you, and good luck! |
Hi, could you please review my new PR #613 when you have a moment? Thank you! |
### Description - Improve API docs of all crawlers. ### Issues - Relates #304 ### Testing - The doc website was rendered locally. ### Checklist - [x] CI passed --------- Co-authored-by: Jan Buchar <[email protected]>
### Description - Update API docs of public components. - Add functions (context/crawler helpers) to a "Functions" API docs section. ### Issues - Closes: #304 ### Testing - The docs were rendered locally. ### Checklist - [x] CI passed
Improve API docs of the public components, mainly:
BasicCrawler
HttpCrawler
BeautifulSoupCrawler
ParselCrawler
PlaywrightCrawler
Dataset
KeyValueStore
RequestQueue
MemoryStorageClient
HttpxHttpClient
CurlImpersonateHttpClient
Configuration
EventManager
LocalEventManager
Request
Session
SessionPool
BrowserPool
PlaywrightBrowserController
PlaywrightBrowserPlugin
Statistics
And functions:
AddRequestsFunction
EnqueueLinksFunction
ExportToFunction
GetDataFunction
GetKeyValueStoreFunction
PushDataFunction
SendRequestFunction
UseStateFunction
The text was updated successfully, but these errors were encountered: