11# ๐ท๏ธ ScrapeGraphAI: ํ ๋ฐฉ์ ๋๋ด๋ ์น์คํฌ๋ํ
22
33
4- ScrapeGraphAI๋ ์น ์ฌ์ดํธ์ ๋ก์ปฌ ๋ฌธ์(XML, HTML, JSON ๋ฑ)์ ๋ํ ์คํฌ๋ํ ํ์ดํ๋ผ์ธ์ ๋ง๋ค๊ธฐ ์ํด LLM ๋ฐ ์ง์ ๊ทธ๋ํ ๋ก์ง์ ์ฌ์ฉํ๋ ํ์ด์ฌ ์น์คํฌ๋ํ ๋ผ์ด๋ธ๋ฌ๋ฆฌ์
๋๋ค.
4+ [ ScrapeGraphAI ] ( https://scrapegraphai.com ) ๋ ์น ์ฌ์ดํธ์ ๋ก์ปฌ ๋ฌธ์(XML, HTML, JSON ๋ฑ)์ ๋ํ ์คํฌ๋ํ ํ์ดํ๋ผ์ธ์ ๋ง๋ค๊ธฐ ์ํด LLM ๋ฐ ์ง์ ๊ทธ๋ํ ๋ก์ง์ ์ฌ์ฉํ๋ ํ์ด์ฌ ์น์คํฌ๋ํ ๋ผ์ด๋ธ๋ฌ๋ฆฌ์
๋๋ค.
55
66์ถ์ถํ๋ ค๋ ์ ๋ณด๋ฅผ ๋งํ๊ธฐ๋ง ํ๋ฉด ๋ผ์ด๋ธ๋ฌ๋ฆฌ๊ฐ ์์์ ์ฒ๋ฆฌํด ์ค๋๋ค!
77
88<p align =" center " >
99 <img src =" https://raw.githubusercontent.com/VinciGit00/Scrapegraph-ai/main/docs/assets/sgai-hero.png " alt =" ScrapeGraphAI Hero " style =" width : 100% ;" >
1010</p >
1111
12+ ## ๐ ํตํฉ
13+ ScrapeGraphAI๋ ์ธ๊ธฐ ์๋ ํ๋ ์์ํฌ ๋ฐ ๋๊ตฌ์์ ์ํํ ํตํฉ์ ์ ๊ณตํ์ฌ ์คํฌ๋ํ ๋ฅ๋ ฅ์ ํฅ์์ํต๋๋ค. ํ์ด์ฌ์ด๋ Node.js๋ก ๊ฐ๋ฐํ๋ , LLM ํ๋ ์์ํฌ๋ฅผ ์ฌ์ฉํ๋ , ๋
ธ์ฝ๋ ํ๋ซํผ์ด๋ ์ ํฌ์ ํฌ๊ด์ ์ธ ํตํฉ ์ต์
์ ์ ๊ณตํฉ๋๋ค.
14+
15+ ๋ ๋ง์ ์ ๋ณด๋ ๋ค์ [ ๋งํฌ] ( https://scrapegraphai.com ) ์์ ํ์ธํ ์ ์์ต๋๋ค
16+
17+ ** Integrations** :
18+ - ** API** : [ Documentation] ( https://docs.scrapegraphai.com/introduction )
19+ - ** SDKs** : [ Python] ( https://docs.scrapegraphai.com/sdks/python ) , [ Node] ( https://docs.scrapegraphai.com/sdks/javascript )
20+ - ** LLM Frameworks** : [ Langchain] ( https://docs.scrapegraphai.com/integrations/langchain ) , [ Llama Index] ( https://docs.scrapegraphai.com/integrations/llamaindex ) , [ Crew.ai] ( https://docs.scrapegraphai.com/integrations/crewai ) , [ Agno] ( https://docs.scrapegraphai.com/integrations/agno ) , [ CamelAI] ( https://github.com/camel-ai/camel )
21+ - ** Low-code Frameworks** : [ Pipedream] ( https://pipedream.com/apps/scrapegraphai ) , [ Bubble] ( https://bubble.io/plugin/scrapegraphai-1745408893195x213542371433906180 ) , [ Zapier] ( https://zapier.com/apps/scrapegraphai/integrations ) , [ n8n] ( http://localhost:5001/dashboard ) , [ Dify] ( https://dify.ai ) , [ Toolhouse] ( https://app.toolhouse.ai/mcp-servers/scrapegraph_smartscraper )
22+ - ** MCP server** : [ Link] ( https://smithery.ai/server/@ScrapeGraphAI/scrapegraph-mcp )
23+
1224## ๐ ๋น ๋ฅธ ์ค์น
1325
1426Scrapegraph-ai์ ๋ํ ์ฐธ์กฐ ํ์ด์ง๋ PyPI์ ๊ณต์ ํ์ด์ง์์ ํ์ธํ ์ ์์ต๋๋ค: [ pypi] ( https://pypi.org/project/scrapegraphai/ ) .
1527
1628``` bash
1729pip install scrapegraphai
30+
31+ # ์ค์ (์น์ฌ์ดํธ ์คํฌ๋ํ์ ์ํ ์ค์น)
32+ playwright install
1833```
1934์ฐธ๊ณ : ๋ค๋ฅธ ๋ผ์ด๋ธ๋ฌ๋ฆฌ์์ ์ถฉ๋์ ํผํ๊ธฐ ์ํด ๋ผ์ด๋ธ๋ฌ๋ฆฌ๋ฅผ ๊ฐ์ ํ๊ฒฝ์ ์ค์นํ๋ ๊ฒ์ด ์ข์ต๋๋ค ๐ฑ
2035
21- ## ๐ ๋ฐ๋ชจ
22-
23- ๊ณต์ Streamlit ๋ฐ๋ชจ:
24-
25- [ ![ My Skills] ( https://skillicons.dev/icons?i=react )] ( https://scrapegraph-ai-web-dashboard.streamlit.app )
26-
27-
28- Google Colab์ ์ฌ์ฉํ์ฌ ์น์์ ์ง์ ์ฌ์ฉํด ๋ณด์ธ์:
29-
30- [ ![ Open In Colab] ( https://colab.research.google.com/assets/colab-badge.svg )] ( https://colab.research.google.com/drive/1sEZBonBMGP44CtO6GQTwAlL0BGJXjtfd?usp=sharing )
31-
32-
33- ## ๐ ๋ฌธ์
34-
35- ScrapeGraphAI์ ๋ํ ๋ฌธ์๋ [ ์ฌ๊ธฐ] ( https://scrapegraph-ai.readthedocs.io/en/latest/ ) ์์ ์ฐพ์ ์ ์์ต๋๋ค.
36-
37- ๋ํ Docusaurus๋ฅผ [ ์ฌ๊ธฐ] ( https://scrapegraph-doc.onrender.com/ ) ์์ ํ์ธํด ๋ณด์ธ์.
38-
3936## ๐ป ์ฌ์ฉ๋ฒ
37+ ์น์ฌ์ดํธ(๋๋ ๋ก์ปฌ ํ์ผ)์์ ์ ๋ณด๋ฅผ ์ถ์ถํ๊ธฐ ์ํด ์ฌ์ฉํ ์ ์๋ ์ฌ๋ฌ ํ์ค ์คํฌ๋ํ ํ์ดํ๋ผ์ธ์ด ์์ต๋๋ค.
4038
41- ์น์ฌ์ดํธ(๋๋ ๋ก์ปฌ ํ์ผ)์์ ์ ๋ณด๋ฅผ ์ถ์ถํ๊ธฐ ์ํด ์ฌ์ฉํ ์ ์๋ ์ฌ๋ฌ ํ์ค ์คํฌ๋ํ ํ์ดํ๋ผ์ธ์ด ์์ต๋๋ค:
42- - ` SmartScraperGraph ` : ์ฌ์ฉ์ ํ๋กฌํํธ์ ์
๋ ฅ ์์ค๋ง ํ์๋ก ํ๋ ๋จ์ผ ํ์ด์ง ์คํฌ๋ํผ์
๋๋ค.
43- - ` SearchGraph ` : ๊ฒ์ ์์ง์ ์์ n๊ฐ ๊ฒ์ ๊ฒฐ๊ณผ์์ ์ ๋ณด๋ฅผ ์ถ์ถํ๋ ๋ค์ค ํ์ด์ง ์คํฌ๋ํผ์
๋๋ค.
44- - ` SpeechGraph ` : ์น์ฌ์ดํธ์์ ์ ๋ณด๋ฅผ ์ถ์ถํ๊ณ ์ค๋์ค ํ์ผ์ ์์ฑํ๋ ๋จ์ผ ํ์ด์ง ์คํฌ๋ํผ์
๋๋ค.
45- - ` ScriptCreatorGraph ` : ์น์ฌ์ดํธ์์ ์ ๋ณด๋ฅผ ์ถ์ถํ๊ณ Python ์คํฌ๋ฆฝํธ๋ฅผ ์์ฑํ๋ ๋จ์ผ ํ์ด์ง ์คํฌ๋ํผ์
๋๋ค.
46-
47- - ` SmartScraperMultiGraph ` : ๋จ์ผ ํ๋กฌํํธ์ ์์ค ๋ชฉ๋ก์ ์ฌ์ฉํ์ฌ ์ฌ๋ฌ ํ์ด์ง์์ ์ ๋ณด๋ฅผ ์ถ์ถํ๋ ๋ค์ค ํ์ด์ง ์คํฌ๋ํผ์
๋๋ค.
48- - ` ScriptCreatorMultiGraph ` : ๋จ์ผ ํ๋กฌํํธ์ ์์ค ๋ชฉ๋ก์ ์ฌ์ฉํ์ฌ ์ฌ๋ฌ ํ์ด์ง์์ ์ ๋ณด๋ฅผ ์ถ์ถํ๋ Python ์คํฌ๋ฆฝํธ๋ฅผ ์์ฑํ๋ ๋ค์ค ํ์ด์ง ์คํฌ๋ํผ์
๋๋ค.
39+ ๊ฐ์ฅ ์ผ๋ฐ์ ์ธ ๊ฒ์ ` SmartScraperGraph ` ๋ก, ์ฌ์ฉ์ ํ๋กฌํํธ์ ์์ค URL์ด ์ฃผ์ด์ง ๋จ์ผ ํ์ด์ง์์ ์ ๋ณด๋ฅผ ์ถ์ถํฉ๋๋ค.
4940
50- ** OpenAI** , ** Groq** , ** Azure** , ** Gemini** ์ ๊ฐ์ API๋ฅผ ํตํด ๋ค์ํ LLM์ ์ฌ์ฉํ ์ ์์ผ๋ฉฐ, ** Ollama** ๋ฅผ ์ฌ์ฉํ์ฌ ๋ก์ปฌ ๋ชจ๋ธ๋ ์ฌ์ฉํ ์ ์์ต๋๋ค.
51-
52- ### ์ฌ๋ก 1: ๋ก์ปฌ ๋ชจ๋ธ์ ์ฌ์ฉํ๋ SmartScraper
53- [ Ollama] ( https://ollama.com/ ) ๋ฅผ ์ค์นํ๊ณ ** ollama pull** ๋ช
๋ น์ ์ฌ์ฉํ์ฌ ๋ชจ๋ธ์ ๋ค์ด๋ก๋ํ์ธ์.
5441
5542``` python
5643from scrapegraphai.graphs import SmartScraperGraph
5744
45+ # ์คํฌ๋ํ ํ์ดํ๋ผ์ธ์ ๋ํ ๊ตฌ์ฑ ์ ์
5846graph_config = {
5947 " llm" : {
60- " model" : " ollama/mistral" ,
61- " temperature" : 0 ,
62- " format" : " json" , # Ollama needs the format to be specified explicitly
63- " base_url" : " http://localhost:11434" , # set Ollama URL
64- },
65- " embeddings" : {
66- " model" : " ollama/nomic-embed-text" ,
67- " base_url" : " http://localhost:11434" , # set Ollama URL
48+ " model" : " ollama/llama3.2" ,
49+ " model_tokens" : 8192
6850 },
6951 " verbose" : True ,
52+ " headless" : False ,
7053}
7154
55+ # SmartScraperGraph ์ธ์คํด์ค ์์ฑ
7256smart_scraper_graph = SmartScraperGraph(
73- prompt = " List me all the projects with their descriptions" ,
74- # also accepts a string with the already downloaded HTML code
75- source = " https://perinim.github.io/projects" ,
57+ prompt = " Extract useful information from the webpage, including a description of what the company does, founders and social media links" ,
58+ source = " https://scrapegraphai.com/" ,
7659 config = graph_config
7760)
7861
62+ # ํ์ดํ๋ผ์ธ ์คํ
7963result = smart_scraper_graph.run()
80- print (result)
81- ```
8264
83- ์ถ๋ ฅ์ ๋ค์๊ณผ ๊ฐ์ด ํ๋ก์ ํธ์ ์ค๋ช
์ ๋ชฉ๋ก์ด ๋ ๊ฒ์
๋๋ค:
84-
85- ``` python
86- {' projects' : [{' title' : ' Rotary Pendulum RL' , ' description' : ' Open Source project aimed at controlling a real life rotary pendulum using RL algorithms' }, {' title' : ' DQN Implementation from scratch' , ' description' : ' Developed a Deep Q-Network algorithm to train a simple and double pendulum' }, ... ]}
65+ import json
66+ print (json.dumps(result, indent = 4 ))
8767```
8868
89- ### ์ฌ๋ก 2: ํผํฉ ๋ชจ๋ธ์ ์ฌ์ฉํ๋ SearchGraph
90- ์ฐ๋ฆฌ๋ LLM์ ** Groq** ๋ฅผ ์ฌ์ฉํ๊ณ , ์๋ฒ ๋ฉ์ ** Ollama** ๋ฅผ ์ฌ์ฉํฉ๋๋ค.
69+ > [ !NOTE]
70+ > OpenAI๋ ๋ค๋ฅธ ๋ชจ๋ธ๋ค์ LLM ์ค์ ๋ง ๋ฐ๊พธ๋ฉด ๋ฉ๋๋ค!
71+ > ``` python
72+ > graph_config = {
73+ > " llm" : {
74+ > " api_key" : " YOUR_OPENAI_API_KEY" ,
75+ > " model" : " openai/gpt-4o-mini" ,
76+ > },
77+ > " verbose" : True ,
78+ > " headless" : False ,
79+ > }
80+ > ```
9181
92- ``` python
93- from scrapegraphai.graphs import SearchGraph
9482
95- # Define the configuration for the graph
96- graph_config = {
97- " llm" : {
98- " model" : " groq/gemma-7b-it" ,
99- " api_key" : " GROQ_API_KEY" ,
100- " temperature" : 0
101- },
102- " embeddings" : {
103- " model" : " ollama/nomic-embed-text" ,
104- " base_url" : " http://localhost:11434" , # set ollama URL arbitrarily
105- },
106- " max_results" : 5 ,
107- }
108-
109- # Create the SearchGraph instance
110- search_graph = SearchGraph(
111- prompt = " List me all the traditional recipes from Chioggia" ,
112- config = graph_config
113- )
114-
115- # Run the graph
116- result = search_graph.run()
117- print (result)
118- ```
119-
120- ์ถ๋ ฅ์ ๋ค์๊ณผ ๊ฐ์ด ๋ ์ํผ ๋ชฉ๋ก์ด ๋ ๊ฒ์
๋๋ค:
83+ ์ถ๋ ฅ์ ๋ค์๊ณผ ๊ฐ์ dictionary ํํ๊ฐ ๋ ๊ฒ์
๋๋ค
12184
12285```python
123- {' recipes' : [{' name' : ' Sarde in Saรฒre' }, {' name' : ' Bigoli in salsa' }, {' name' : ' Seppie in umido' }, {' name' : ' Moleche frite' }, {' name' : ' Risotto alla pescatora' }, {' name' : ' Broeto' }, {' name' : ' Bibarasse in Cassopipa' }, {' name' : ' Risi e bisi' }, {' name' : ' Smegiassa Ciosota' }]}
86+ {
87+ " description" : " ScrapeGraphAI transforms websites into clean, organized data for AI agents and data analytics. It offers an AI-powered API for effortless and cost-effective data extraction." ,
88+ " founders" : [
89+ {
90+ " name" : " " ,
91+ " role" : " Founder & Technical Lead" ,
92+ " linkedin" : " https://www.linkedin.com/in/perinim/"
93+ },
94+ {
95+ " name" : " Marco Vinciguerra" ,
96+ " role" : " Founder & Software Engineer" ,
97+ " linkedin" : " https://www.linkedin.com/in/marco-vinciguerra-7ba365242/"
98+ },
99+ {
100+ " name" : " Lorenzo Padoan" ,
101+ " role" : " Founder & Product Engineer" ,
102+ " linkedin" : " https://www.linkedin.com/in/lorenzo-padoan-4521a2154/"
103+ }
104+ ],
105+ " social_media_links" : {
106+ " linkedin" : " https://www.linkedin.com/company/101881123" ,
107+ " twitter" : " https://x.com/scrapegraphai" ,
108+ " github" : " https://github.com/ScrapeGraphAI/Scrapegraph-ai"
109+ }
110+ }
124111```
125- ### ์ฌ๋ก 3: OpenAI๋ฅผ ์ฌ์ฉํ๋ SpeechGraph
112+ ์ฌ๋ฌ ํ์ด์ง์์ ์ ๋ณด๋ฅผ ์ถ์ถํ๊ฑฐ๋, Python ์คํฌ๋ฆฝํธ๋ฅผ ์์ฑํ๊ฑฐ๋, ์ฌ์ง์ด ์ค๋์ค ํ์ผ์ ์์ฑํ๋ ๋ฐ ์ฌ์ฉํ ์ ์๋ ๋ค๋ฅธ ํ์ดํ๋ผ์ธ๋ ์์ต๋๋ค.
126113
127- OpenAI API ํค์ ๋ชจ๋ธ ์ด๋ฆ๋ง ์ ๋ฌํ๋ฉด ๋ฉ๋๋ค.
114+ | ํ์ดํ๋ผ์ธ | ์ค๋ช
|
115+ | -------------------------| ------------------------------------------------------------------------------------------------------------------|
116+ | SmartScraperGraph | ์ฌ์ฉ์ ํ๋กฌํํธ์ ์
๋ ฅ ์์ค๋ง ์์ผ๋ฉด ๋๋ ๋จ์ผ ํ์ด์ง ์คํฌ๋ํผ์
๋๋ค.
117+ |
118+ | SearchGraph | ๊ฒ์ ์์ง์ ์์ n๊ฐ ๊ฒ์ ๊ฒฐ๊ณผ์์ ์ ๋ณด๋ฅผ ์ถ์ถํ๋ ๋ค์ค ํ์ด์ง ์คํฌ๋ํผ์
๋๋ค. |
119+ | SpeechGraph | ์น์ฌ์ดํธ์์ ์ ๋ณด๋ฅผ ์ถ์ถํ๊ณ ์ค๋์ค ํ์ผ์ ์์ฑํ๋ ๋จ์ผ ํ์ด์ง ์คํฌ๋ํผ์
๋๋ค. |
120+ | ScriptCreatorGraph | ์น์ฌ์ดํธ์์ ์ ๋ณด๋ฅผ ์ถ์ถํ๊ณ ํ์ด์ฌ ์คํฌ๋ฆฝํธ๋ฅผ ์์ฑํ๋ ๋จ์ผ ํ์ด์ง ์คํฌ๋ํผ์
๋๋ค.
121+ |
122+ | SmartScraperMultiGraph | ๋จ์ผ ํ๋กฌํํธ์ ์ถ์ฒ ๋ชฉ๋ก์ด ์ฃผ์ด์ง๋ฉด ์ฌ๋ฌ ํ์ด์ง์์ ์ ๋ณด๋ฅผ ์ถ์ถํ๋ ๋ค์ค ํ์ด์ง ์คํฌ๋ํผ์
๋๋ค. |
123+ | ScriptCreatorMultiGraph | ์ฌ๋ฌ ํ์ด์ง์ ์์ค์์ ์ ๋ณด๋ฅผ ์ถ์ถํ๊ธฐ ์ํ ํ์ด์ฌ ์คํฌ๋ฆฝํธ๋ฅผ ์์ฑํ๋ ๋ค์ค ํ์ด์ง ์คํฌ๋ํผ์
๋๋ค. |
128124
129- ``` python
130- from scrapegraphai.graphs import SpeechGraph
125+ ๊ฐ ๊ทธ๋ํ์๋ ๋ค์ค ๋ฒ์ ์ด ์์ต๋๋ค. ์ด๋ฅผ ํตํด LLM์ ๋ณ๋ ฌ๋ก ํธ์ถํ ์ ์์ต๋๋ค.
131126
132- graph_config = {
133- " llm" : {
134- " api_key" : " OPENAI_API_KEY" ,
135- " model" : " openai/gpt-3.5-turbo" ,
136- },
137- " tts_model" : {
138- " api_key" : " OPENAI_API_KEY" ,
139- " model" : " tts-1" ,
140- " voice" : " alloy"
141- },
142- " output_path" : " audio_summary.mp3" ,
143- }
127+ OpenAI, Groq, Azure, Gemini์ ๊ฐ์ API๋ฅผ ํตํด ๋ค์ํ LLM์ ์ฌ์ฉํ ์ ์์ผ๋ฉฐ, Ollama๋ฅผ ์ด์ฉํ ๋ก์ปฌ ๋ชจ๋ธ๋ ๊ฐ๋ฅํฉ๋๋ค.
144128
145- # ************************************************
146- # Create the SpeechGraph instance and run it
147- # ************************************************
129+ ๋ก์ปฌ ๋ชจ๋ธ์ ์ฌ์ฉํ๋ ค๋ฉด [ Ollama] ( https://ollama.com/ ) ๋ฅผ ์ค์นํ๊ณ ollama pull ๋ช
๋ น์ ์ฌ์ฉํ์ฌ ๋ชจ๋ธ์ ๋ค์ด๋ก๋ํด์ผ ํฉ๋๋ค.
148130
149- speech_graph = SpeechGraph(
150- prompt = " Make a detailed audio summary of the projects." ,
151- source = " https://perinim.github.io/projects/" ,
152- config = graph_config,
153- )
154131
155- result = speech_graph.run()
156- print (result)
132+ ## ๐ Documentation
157133
158- ```
134+ [ ![ Colab์ผ๋ก ์ด๊ธฐ ] ( https://colab.research.google.com/assets/colab-badge.svg )] ( https://colab.research.google.com/drive/1sEZBonBMGP44CtO6GQTwAlL0BGJXjtfd?usp=sharing )
159135
160- ์ถ๋ ฅ์ ํ์ด์ง์ ํ๋ก์ ํธ ์์ฝ์ด ํฌํจ๋ ์ค๋์ค ํ์ผ์ด ๋ ๊ฒ์
๋๋ค.
136+ ScrapeGraphAI ๊ด๋ จ ๋ฌธ์๋ [ ์ฌ๊ธฐ] ( https://scrapegraph-ai.readthedocs.io/en/latest/ ) ์์ ํ์ธํ์ค ์ ์์ต๋๋ค.
137+ Docusaurus๋ [ ์ฌ๊ธฐ] ( https://docs-oss.scrapegraphai.com/ ) ์์ ํ์ธํด ๋ณด์ธ์.
161138
162- ## ์คํฐ
139+ ## ๐ค Contributing
163140
164- <div style =" text-align : center ;" >
165- <a href =" https://serpapi.com?utm_source=scrapegraphai " >
166- <img src="https://raw.githubusercontent.com/VinciGit00/Scrapegraph-ai/main/docs/assets/serp_api_logo.png" alt="SerpAPI" style="width: 10%;">
167- </a >
168- <a href =" https://dashboard.statproxies.com/?refferal=scrapegraph " >
169- <img src="https://raw.githubusercontent.com/VinciGit00/Scrapegraph-ai/main/docs/assets/transparent_stat.png" alt="Stats" style="width: 15%;">
170- </a >
171- </div >
141+ ์์ ๋กญ๊ฒ ๊ธฐ์ฌํ๊ณ Discord ์๋ฒ์ ์ฐธ์ฌํ์ฌ ๊ฐ์ ์ฌํญ์ ๋
ผ์ํ๊ณ ์ ์ํด ์ฃผ์ธ์!
172142
173- ## ๐ค ๊ธฐ์ฌ
143+ [ ๊ธฐ์ฌ ๊ฐ์ด๋๋ผ์ธ ] ( https://github.com/VinciGit00/Scrapegraph-ai/blob/main/CONTRIBUTING.md ) ์ ์ฐธ๊ณ ํ์ธ์.
174144
175- ๊ธฐ์ฌ๋ฅผ ํ์ํ๋ฉฐ, ๊ฐ์ ์ฌํญ์ ๋
ผ์ํ๊ณ ์ ์ ์ฌํญ์ ์ฃผ๊ณ ๋ฐ๊ธฐ ์ํด ์ฐ๋ฆฌ์ Discord ์๋ฒ์ ์ฐธ์ฌํ์ธ์!
145+ [ ![ My Skills] ( https://skillicons.dev/icons?i=discord )] ( https://discord.gg/uJN7TYcpNa )
146+ [ ![ My Skills] ( https://skillicons.dev/icons?i=linkedin )] ( https://www.linkedin.com/company/scrapegraphai/ )
147+ [ ![ My Skills] ( https://skillicons.dev/icons?i=twitter )] ( https://twitter.com/scrapegraphai )
176148
177- ๊ธฐ์ฌ ๊ฐ์ด๋๋ผ์ธ์ ์ฐธ๊ณ ํด์ฃผ์ธ์: [ contributing guidelines] ( https://github.com/VinciGit00/Scrapegraph-ai/blob/main/CONTRIBUTING.md ) .
149+ ## ๐ ScrapeGraph API & SDKs
150+ ์์คํ
์ ScrapeGraph๋ฅผ ํตํฉํ๊ธฐ ์ํ ๋น ๋ฅธ ์๋ฃจ์
์ ์ฐพ๊ณ ์๋ค๋ฉด, [ ์ฌ๊ธฐ!] ( https://dashboard.scrapegraphai.com/login ) ์์ ๊ฐ๋ ฅํ API๋ฅผ ํ์ธํด ๋ณด์ธ์.
178151
179- ## ๐ ๋ก๋๋งต
152+ <p align =" center " >
153+ <img src =" https://raw.githubusercontent.com/VinciGit00/Scrapegraph-ai/main/docs/assets/api-banner.png " alt =" ScrapeGraph API Banner " style =" width : 100% ;" >
154+ </p >
180155
181- ๋ค์ ๊ธฐ๋ฅ๋ค์ ์์
ํ๊ณ ์์ต๋๋ค! ํ์
์ ๊ด์ฌ์ด ์์ผ์๋ฉด ํด๋น ๊ธฐ๋ฅ์ ๋ง์ฐ์ค ์ค๋ฅธ์ชฝ ๋ฒํผ์ผ๋ก ํด๋ฆญํ์ฌ ์ ํญ์์ PR์ ์์ฑํด์ฃผ์ธ์. ์๋ฌธ์ฌํญ์ด ์๊ฑฐ๋ ๋
ผ์ํ๊ณ ์ถ๋ค๋ฉด [ Discord ] ( https://discord.gg/uJN7TYcpNa ) ์์ ์ ํฌ์๊ฒ ์ฐ๋ฝํ๊ฑฐ๋ Github์ [ Discussion ] ( https://github.com/VinciGit00/Scrapegraph-ai/discussions ) ํ์ด์ง๋ฅผ ์ด์ด์ฃผ์ธ์!
156+ Python๊ณผ Node.js SDK๋ฅผ ์ ๊ณตํ์ฌ ํ๋ก์ ํธ์ ์ฝ๊ฒ ํตํฉํ ์ ์์ต๋๋ค. ์๋์์ ํ์ธํด ๋ณด์ธ์.
182157
183- ``` mermaid
184- %%{init: {'theme': 'base', 'themeVariables': { 'primaryColor': '#5C4B9B', 'edgeLabelBackground':'#ffffff', 'tertiaryColor': '#ffffff', 'primaryBorderColor': '#5C4B9B', 'fontFamily': 'Arial', 'fontSize': '16px', 'textColor': '#5C4B9B' }}}%%
185- graph LR
186- A[DeepSearch Graph] --> F[Use Existing Chromium Instances]
187- F --> B[Page Caching]
188- B --> C[Screenshot Scraping]
189- C --> D[Handle Dynamic Content]
190- D --> E[New Webdrivers]
158+ | SDK | Language | GitHub Link |
159+ | -----------| ----------| -----------------------------------------------------------------------------|
160+ | Python SDK | Python | [ scrapegraph-py] ( https://github.com/ScrapeGraphAI/scrapegraph-sdk/tree/main/scrapegraph-py ) |
161+ | Node.js SDK | Node.js | [ scrapegraph-js] ( https://github.com/ScrapeGraphAI/scrapegraph-sdk/tree/main/scrapegraph-js ) |
191162
192- style A fill:#ffffff,stroke:#5C4B9B,stroke-width:2px,rx:10,ry:10
193- style F fill:#ffffff,stroke:#5C4B9B,stroke-width:2px,rx:10,ry:10
194- style B fill:#ffffff,stroke:#5C4B9B,stroke-width:2px,rx:10,ry:10
195- style C fill:#ffffff,stroke:#5C4B9B,stroke-width:2px,rx:10,ry:10
196- style D fill:#ffffff,stroke:#5C4B9B,stroke-width:2px,rx:10,ry:10
197- style E fill:#ffffff,stroke:#5C4B9B,stroke-width:2px,rx:10,ry:10
163+ ๊ณต์ API ๋ฌธ์๋ [ ์ฌ๊ธฐ] ( https://docs.scrapegraphai.com/ ) ์์ ํ์ธํ ์ ์์ต๋๋ค.
198164
199- click A href "https://github.com/VinciGit00/Scrapegraph-ai/issues/260" "Open DeepSearch Graph Issue"
200- click F href "https://github.com/VinciGit00/Scrapegraph-ai/issues/329" "Open Chromium Instances Issue"
201- click B href "https://github.com/VinciGit00/Scrapegraph-ai/issues/197" "Open Page Caching Issue"
202- click C href "https://github.com/VinciGit00/Scrapegraph-ai/issues/197" "Open Screenshot Scraping Issue"
203- click D href "https://github.com/VinciGit00/Scrapegraph-ai/issues/279" "Open Handle Dynamic Content Issue"
204- click E href "https://github.com/VinciGit00/Scrapegraph-ai/issues/171" "Open New Webdrivers Issue"
205- ```
165+ ## ๐ Telemetry
166+ ์ ํฌ๋ ํจํค์ง์ ํ์ง๊ณผ ์ฌ์ฉ์ ๊ฒฝํ์ ํฅ์์ํค๊ธฐ ์ํด ์ต๋ช
์ ์ฌ์ฉ ์งํ๋ฅผ ์์งํฉ๋๋ค. ์ด ๋ฐ์ดํฐ๋ ๊ฐ์ ์ฌํญ์ ์ฐ์ ์์๋ฅผ ์ ํ๊ณ ํธํ์ฑ์ ๋ณด์ฅํ๋ ๋ฐ ๋์์ด ๋ฉ๋๋ค. ์ตํธ์์ํ๋ ค๋ฉด ํ๊ฒฝ ๋ณ์ SCRAPEGRAPHAI_TELEMETRY_ENABLED=false๋ฅผ ์ค์ ํ์ธ์. ์์ธํ ๋ด์ฉ์ [ ์ฌ๊ธฐ] ( https://scrapegraph-ai.readthedocs.io/en/latest/scrapers/telemetry.html ) ์์ ์ค๋ช
์๋ฅผ ์ฐธ์กฐํ์ธ์.
206167
207168## ๏ธ ๊ธฐ์ฌ์๋ค
208169[ ![ Contributors] ( https://contrib.rocks/image?repo=VinciGit00/Scrapegraph-ai )] ( https://github.com/VinciGit00/Scrapegraph-ai/graphs/contributors )
209170
210171## ๐ ์ธ์ฉ
211- ์ฐ๋ฆฌ์ ๋ผ์ด๋ธ๋ฌ๋ฆฌ๋ฅผ ์ฐ๊ตฌ ๋ชฉ์ ์ผ๋ก ์ฌ์ฉํ ๊ฒฝ์ฐ ๋ค์๊ณผ ๊ฐ์ด ์ธ์ฉํด ์ฃผ์ธ์:
172+ ์ฐ๋ฆฌ์ ๋ผ์ด๋ธ๋ฌ๋ฆฌ๋ฅผ ์ฐ๊ตฌ ๋ชฉ์ ์ผ๋ก ์ฌ์ฉํ ๊ฒฝ์ฐ ๋ค์๊ณผ ๊ฐ์ด ์ธ์ฉํด ์ฃผ์ธ์
212173``` text
213174 @misc{scrapegraph-ai,
214175 author = {, Lorenzo Padoan, Marco Vinciguerra},
@@ -221,19 +182,14 @@ graph LR
221182
222183## ์ ์๋ค
223184
224- <p align =" center " >
225- <img src =" https://raw.githubusercontent.com/VinciGit00/Scrapegraph-ai/main/docs/assets/logo_authors.png " alt =" Authors_logos " >
226- </p >
227-
228185| | ์ฐ๋ฝ์ฒ |
229186| --------------------| ---------------|
230187| Marco Vinciguerra | [ ![ Linkedin Badge] ( https://img.shields.io/badge/-Linkedin-blue?style=flat&logo=Linkedin&logoColor=white )] ( https://www.linkedin.com/in/marco-vinciguerra-7ba365242/ ) |
231- | | [ ![ Linkedin Badge] ( https://img.shields.io/badge/-Linkedin-blue?style=flat&logo=Linkedin&logoColor=white )] ( https://www.linkedin.com/in/perinim/ ) |
232188| Lorenzo Padoan | [ ![ Linkedin Badge] ( https://img.shields.io/badge/-Linkedin-blue?style=flat&logo=Linkedin&logoColor=white )] ( https://www.linkedin.com/in/lorenzo-padoan-4521a2154/ ) |
233189
234190## ๐ ๋ผ์ด์ ์ค
235191
236- ScrapeGraphAI๋ MIT License๋ก ๋ฐฐํฌ๋์์ต๋ . ์์ธํ ๋ด์ฉ์ [ LICENSE] ( https://github.com/VinciGit00/Scrapegraph-ai/blob/main/LICENSE ) ํ์ผ์ ์ฐธ์กฐํ์ธ์.
192+ ScrapeGraphAI๋ MIT License๋ก ๋ฐฐํฌ๋์์ต๋๋ค . ์์ธํ ๋ด์ฉ์ [ LICENSE] ( https://github.com/VinciGit00/Scrapegraph-ai/blob/main/LICENSE ) ํ์ผ์ ์ฐธ์กฐํ์ธ์.
237193
238194## ๊ฐ์ฌ์ ๋ง
239195
0 commit comments