@@ -10,9 +10,9 @@ URL parser data enrichment is extracting various URL components to gain addition
1010context about the URL. This extracted information can be used for data analysis, marketing, SEO, and
1111more.
1212
13- ## Setup Guide
13+ ## URL parsing process
1414
15- To enable this is URL parser data enrichment. A few important steps are involved :
15+ Here is step-by-step process for URL parser data enrichment :
1616
17171 . Get the URL data that is needed to be parsed from a source or create one.
18181 . Send the URL data to an API like [ URL Parser API] ( https://urlparse.com/ ) .
@@ -23,7 +23,8 @@ To enable this is URL parser data enrichment. A few important steps are involved
2323We use ** [ URL Parse API] ( https://urlparse.com/ ) ** to extract the information about the URL. However,
2424you can use any API you prefer.
2525
26- :::tip ` URL Parse API ` is free, with a 1000 requests/hour limit, which can be increased on request.
26+ ::: tip
27+ ` URL Parse API ` is free, with 1000 requests/hour limit, which can be increased on request.
2728:::
2829
2930By default the URL Parse API will return a JSON response like:
@@ -121,10 +122,11 @@ Here's the resource that yields the sample data as discussed above:
121122
122123 # Sample data representing tracked user data
123124 sample_data = [
124- {" user_id" : 1 , " device_name" : " Sony Experia XZ" , " page_referer" :
125- " https://b2venture.lightning.force.com/" },
126- {" user_id" : 2 , " device_name" : " Samsung Galaxy S23 Ultra 5G" ,
127- " page_referer" : " https://techcrunch.com/2023/07/20/can-dlthub-solve-the-python-library-problem-for-ai-dig-ventures-thinks-so/" },
125+ {
126+ " user_id" : 1 ,
127+ " device_name" : " Sony Experia XZ" ,
128+ " page_referer" : " https://b2venture.lightning.force.com/"
129+ },
128130 """
129131 Data for other users
130132 """
@@ -182,9 +184,8 @@ need to register to use this service neither get an API key.
182184 The ` dlt ` library's ` transformer ` and ` add_map ` functions serve distinct purposes in data
183185 processing.
184186
185- ` Transformers ` used to process a resource and are ideal for post-load data transformations in a
186- pipeline, compatible with tools like ` dbt ` , the ` dlt SQL client ` , or Pandas for intricate data
187- manipulation. To read more:
187+ ` Transformers ` are a form of ` dlt resource ` that takes input from other resources
188+ via ` data_from ` argument to enrich or transform the data.
188189 [ Click here.] ( ../../general-usage/resource.md#process-resources-with-dlttransformer )
189190
190191 Conversely, ` add_map ` used to customize a resource applies transformations at an item level
@@ -208,7 +209,8 @@ need to register to use this service neither get an API key.
208209 print (load_info)
209210 ```
210211
211- :::info Please note that the same outcome can be achieved by using the transformer function. To
212+ ::: info
213+ Please note that the same outcome can be achieved by using the transformer function. To
212214 do so, you need to add the transformer decorator at the top of the ` url_parser ` function. For
213215 ` pipeline.run ` , you can use the following code:
214216
@@ -220,7 +222,8 @@ need to register to use this service neither get an API key.
220222 )
221223 ```
222224
223- This will execute the ` url_parser ` function with the tracked data and return parsed URL. :::
225+ This will execute the ` url_parser ` function with the tracked data and return parsed URL.
226+ :::
224227
225228### Run the pipeline
226229
0 commit comments