Skip to content

Commit e1d2c3d

Browse files
author
dat-a-man
committed
update
1 parent c812287 commit e1d2c3d

File tree

1 file changed

+15
-12
lines changed

1 file changed

+15
-12
lines changed

docs/website/docs/general-usage/data-enrichments/url-parser-data-enrichment.md

Lines changed: 15 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -10,9 +10,9 @@ URL parser data enrichment is extracting various URL components to gain addition
1010
context about the URL. This extracted information can be used for data analysis, marketing, SEO, and
1111
more.
1212

13-
## Setup Guide
13+
## URL parsing process
1414

15-
To enable this is URL parser data enrichment. A few important steps are involved:
15+
Here is step-by-step process for URL parser data enrichment :
1616

1717
1. Get the URL data that is needed to be parsed from a source or create one.
1818
1. Send the URL data to an API like [URL Parser API](https://urlparse.com/).
@@ -23,7 +23,8 @@ To enable this is URL parser data enrichment. A few important steps are involved
2323
We use **[URL Parse API](https://urlparse.com/)** to extract the information about the URL. However,
2424
you can use any API you prefer.
2525

26-
:::tip `URL Parse API` is free, with a 1000 requests/hour limit, which can be increased on request.
26+
:::tip
27+
`URL Parse API` is free, with 1000 requests/hour limit, which can be increased on request.
2728
:::
2829

2930
By default the URL Parse API will return a JSON response like:
@@ -121,10 +122,11 @@ Here's the resource that yields the sample data as discussed above:
121122

122123
# Sample data representing tracked user data
123124
sample_data = [
124-
{"user_id": 1, "device_name": "Sony Experia XZ", "page_referer":
125-
"https://b2venture.lightning.force.com/"},
126-
{"user_id": 2, "device_name": "Samsung Galaxy S23 Ultra 5G",
127-
"page_referer": "https://techcrunch.com/2023/07/20/can-dlthub-solve-the-python-library-problem-for-ai-dig-ventures-thinks-so/"},
125+
{
126+
"user_id": 1,
127+
"device_name": "Sony Experia XZ",
128+
"page_referer": "https://b2venture.lightning.force.com/"
129+
},
128130
"""
129131
Data for other users
130132
"""
@@ -182,9 +184,8 @@ need to register to use this service neither get an API key.
182184
The `dlt` library's `transformer` and `add_map` functions serve distinct purposes in data
183185
processing.
184186

185-
`Transformers` used to process a resource and are ideal for post-load data transformations in a
186-
pipeline, compatible with tools like `dbt`, the `dlt SQL client`, or Pandas for intricate data
187-
manipulation. To read more:
187+
`Transformers` are a form of `dlt resource` that takes input from other resources
188+
via `data_from` argument to enrich or transform the data.
188189
[Click here.](../../general-usage/resource.md#process-resources-with-dlttransformer)
189190

190191
Conversely, `add_map` used to customize a resource applies transformations at an item level
@@ -208,7 +209,8 @@ need to register to use this service neither get an API key.
208209
print(load_info)
209210
```
210211

211-
:::info Please note that the same outcome can be achieved by using the transformer function. To
212+
:::info
213+
Please note that the same outcome can be achieved by using the transformer function. To
212214
do so, you need to add the transformer decorator at the top of the `url_parser` function. For
213215
`pipeline.run`, you can use the following code:
214216

@@ -220,7 +222,8 @@ need to register to use this service neither get an API key.
220222
)
221223
```
222224

223-
This will execute the `url_parser` function with the tracked data and return parsed URL. :::
225+
This will execute the `url_parser` function with the tracked data and return parsed URL.
226+
:::
224227

225228
### Run the pipeline
226229

0 commit comments

Comments
 (0)