파싱정규식 봐주세요 ㅠㅠ

파싱정규식 봐주세요 ㅠㅠ

QA

파싱정규식 봐주세요 ㅠㅠ

본문

바트파싱기로 여기에 리스트 뽑을려고 하는데 제실력으로 하다가 고수님들한테 질문 드립니다

제가 기존에 만든 바트정규식 코드는

1.

~<ul\sclass="i-item">\s+.+?<a\s+href="(./?.act=[^"]+)">([^<]+)\s+.+?<\/li>~isx

2.

~<ul\sclass="i-item">\s+.+?<a\s+href="([^"]+)">([^<]+)\s+.+?<\/li>~isx

둘다 안되네요


전체코드 복사하여 올려드립니다

<!DOCTYPE html>
<html>
<head>
<meta charset="UTF-8">
<title>연변구인정보 - 연변 이얼싼</title>
<meta name="keywords" content="123123,延边招聘,延边人才,延边房产,延边二手,延边二手车,延边黄页" />
<meta name="Description" content="123分类信息网,免费、高效的生活服务平台! 找租房,找二手房,找工作,找兼职,买卖二手,二手车交易,买卖宠物,本地上门服务等海量的生活分类信息,是您最好的免费发布信息网站。" />
<meta http-equiv="X-UA-Compatible" content="IE=edge,chrome=1">
<meta name="viewport" content="width=device-width,initial-scale=1,minimum-scale=1,maximum-scale=1,user-scalable=no">
<link rel="apple-touch-icon" href="../mobile_icon.png">
<link rel="apple-touch-icon-precomposed" href="../mobile_icon.png">
<link rel="shortcut icon" href="../favicon.ico?1" />
<link rel="stylesheet" href="pop/styles/pop.css?ver=2017072409">
<link rel="stylesheet" href="./styles/pc/style.css?ver=2017072409">
<script src="./styles/pc/js/jquery.min.js"></script>
<script src="pop/styles/js/pop.js?ver=2017072409"></script>
<script src="./styles/pc/js/jquery.nicescroll.js"></script>
<script src="./styles/pc/js/common.js?ver=2017072409"></script>
<script src="./lang/kr/message.js?ver=2017072409"></script>
<script src="./styles/common.js?ver=2017072409"></script>
<!--[if lt IE 9]>
<script src="styles/pc/js/html5.js"></script>
<![endif]-->
</head>
<body>
<div class="page-body">
<div class="page-body-in">
<div class="page-wrap">
<!-- header start -->
<header class="header">
<div class="layout">
<div class="logo"><a href="./"><img src="styles/pc/img/logo.png" alt="" border="0"></a></div>
<nav class="top-city-nav">
<div class="city-toggle"><span class="city-cur">연변</span></div>
<div class="city-menu">
<i class="ico-arrow"></i>
<ul>
<li><a href="change_lang.php?lang=beijing">북경</a></li>
<li><a href="change_lang.php?lang=shanghai">상해</a></li>
<li><a href="change_lang.php?lang=tianjin">천진</a></li>
<li><a href="change_lang.php?lang=yanbian">연변</a></li>
<li><a href="change_lang.php?lang=guangdong">광동</a></li>
<li><a href="change_lang.php?lang=shandong">산동</a></li>
<li><a href="change_lang.php?lang=liaoning">료녕</a></li>
<li><a href="change_lang.php?lang=zhejiang">절강</a></li>
<li><a href="change_lang.php?lang=jiangsu">강소</a></li>
</ul>
</div>
</nav>
<div class="top-nav-toggle" onClick="leftMenuShow()">메뉴</div>
</div>
</header>
<!--
说明:默认展开菜单
===============================
<li class="active">
<ul style="display:block;">
...
</ul>
</li>
style="display:none; left:-300px"
-->
<nav class="top-nav-bd" >
<div class="nav-inner">
<ul>
<li class="member-menu">
<h4><a class="m-reg" href="./member/?act=login">로그인</a><a class="m-login" href="./member/?act=join">회원가입</a></h4> </li>
<li><span class="menu"><a href="./">홈으로</a></span></li>
<li><a href="./?act=list&bo_table=biz_caipu">123菜谱</a></li>
<li class="active">
<h4>지역정보 </h4>
<ul style="display:block;">
<li><span class="menu"><a href="./?act=list&bo_table=biz_zhaopin">구인정보</a></span></li>
<li><a href="./?act=list&bo_table=biz_qiuzhi">구직정보</a></li>
<li><a href="./?act=list&bo_table=biz_jianzhi">알바정보</a></li>
<li><a href="./?act=list&bo_table=biz_fang">부동산 정보</a></li>
<li><a href="./?act=list&bo_table=biz_maimai">벼룩시장</a></li>
<li><a href="./?act=list&bo_table=biz_che">중고차</a></li>
<li><a href="./?act=list&bo_table=biz_shangjia">업체광고</a></li>
<li><a href="./?act=list&bo_table=biz_jiaoyou">친구찾기</a></li>
<li><a href="./?act=list&bo_table=biz_mall">쇼핑몰 홍보</a></li>
<li><a href="./?act=list&bo_table=biz_jiaoyu">교육정보</a></li>
<li><a href="./?act=list&bo_table=biz_haoma">전화번호</a></li>
<li><a href="./?act=list&bo_table=biz_youxi">게임관련</a></li>
<li><a href="./?act=list&bo_table=biz_chongwu">애완동물</a></li>
<li><a href="./?act=list&bo_table=zhidao">도움요청</a></li>
</ul>
</li>
<li>
<h4>출국유학 </h4>
<ul style="display:block;">
<li><a href="./?act=list&bo_table=biz_chuguonews">새소식/공지</a></li>
<li><a href="./?act=list&bo_table=biz_chuguoqa">출국지식</a></li>
<li><a href="./?act=list&bo_table=biz_chuguo">출국정보</a></li>
</ul>
</li>
<li>
<h4>고객센터 </h4>
<ul>
<li><a href="./about/bbs_guide.php">게시판 이용약관</a></li>
<li><a href="./about/ad.php">광고문의</a></li>
<li><a href="./about/intro.php">사이트소개</a></li>
</ul>
</li>
<li><a class="lang" href="./change_lang.php?lang=cn">中文版</a></li> </ul>
</div>
</nav>
<!--// header end -->
<div class="layout">
<div class="list-tit-bar">
<h2 class="tit"><a href="./?act=list&bo_table=biz_zhaopin">구인정보</a></h2>
<div class="u-acts">
<a class="btn" href="javascript:showMsg('로그인 후 사용 가능합니다.')">내가쓴글</a>
<a class="btn btn-red" href="javascript:showMsg('로그인 후 글쓰기 가능합니다.')">글쓰기</a>
</div>
</div> <!-- tabs -->
<div class="list-tas-bar">
<ul>
<li class="current"><a href="./?act=list&bo_table=biz_zhaopin">전체</a></li>
<li><a href="./?act=list&bo_table=biz_zhaopin&cate1=1">외식/오락</a></li>
<li><a href="./?act=list&bo_table=biz_zhaopin&cate1=2">판매/매장관리</a></li>
<li><a href="./?act=list&bo_table=biz_zhaopin&cate1=3">사무직</a></li>
<li><a href="./?act=list&bo_table=biz_zhaopin&cate1=4">상담/영업</a></li>
<li><a href="./?act=list&bo_table=biz_zhaopin&cate1=5">IT/디자인</a></li>
<li><a href="./?act=list&bo_table=biz_zhaopin&cate1=6">서비스업</a></li>
<li><a href="./?act=list&bo_table=biz_zhaopin&cate1=7">생산/건설</a></li>
<li><a href="./?act=list&bo_table=biz_zhaopin&cate1=8">강사/교육</a></li>
</ul>
</div>
<!-- list main start -->
<div class="list-main">
<!-- list -->
<section class="list-wrap img-txt-list">
<ul class="i-item">
<li class="row-1">17.09.16</li>
<li class="row-3"><a href="./?act=view&bo_table=biz_zhaopin&id=996074&page=2">龙井招聘口腔护士兼前台 </a></li>
<li class="row-4"></li>
<li class="row-5">7</li>
</ul>
<ul class="i-item">
<li class="row-1">17.09.16</li>
<li class="row-3"><a href="./?act=view&bo_table=biz_zhaopin&id=998884&page=2">厨房阿姨及配菜 </a></li>
<li class="row-4"></li>
<li class="row-5">4</li>
</ul>
<ul class="i-item">
<li class="row-1">17.09.16</li>
<li class="row-3"><a href="./?act=view&bo_table=biz_zhaopin&id=999572&page=2">호프요리 할줄 아시는분 모집합니다. </a></li>
<li class="row-4"></li>
<li class="row-5">9</li>
</ul>
<ul class="i-item">
<li class="row-1">17.09.16</li>
<li class="row-3"><a href="./?act=view&bo_table=biz_zhaopin&id=999647&page=2">옷가게에서 직원급히구합니다 </a></li>
<li class="row-4"></li>
<li class="row-5">78</li>
</ul>
<ul class="i-item">
<li class="row-1">17.09.16</li>
<li class="row-3"><a href="./?act=view&bo_table=biz_zhaopin&id=998149&page=2">쇼핑몰 창업 직원 모집함니다 (남 녀) </a></li>
<li class="row-4"></li>
<li class="row-5">19</li>
</ul>
<ul class="i-item">
<li class="row-1">17.09.16</li>
<li class="row-3"><a href="./?act=view&bo_table=biz_zhaopin&id=999756&page=2">룡정고기집 여직원급구 </a></li>
<li class="row-4"></li>
<li class="row-5">36</li>
</ul>
<ul class="i-item">
<li class="row-1">17.09.16</li>
<li class="row-3"><a href="./?act=view&bo_table=biz_zhaopin&id=996067&page=2">아리랑헤어샵에서 헤어디자이너 스텝을 모집합니다. </a></li>
<li class="row-4"></li>
<li class="row-5">12</li>
</ul>
<ul class="i-item">
<li class="row-1">17.09.16</li>
<li class="row-3"><a href="./?act=view&bo_table=biz_zhaopin&id=999517&page=2">컴퓨터 핸드폰간편하게 </a></li>
<li class="row-4"></li>
<li class="row-5">84</li>
</ul>
<ul class="i-item">
<li class="row-1">17.09.16</li>
<li class="row-3"><a href="./?act=view&bo_table=biz_zhaopin&id=999565&page=2">김밥집에서 주방보조구합니다 </a></li>
<li class="row-4"></li>
<li class="row-5">69</li>
</ul>
<ul class="i-item">
<li class="row-1">17.09.16</li>
<li class="row-3"><a href="./?act=view&bo_table=biz_zhaopin&id=993301&page=2">회관 여복무원을 모집합니다. </a></li>
<li class="row-4"></li>
<li class="row-5">6</li>
</ul>
<ul class="i-item">
<li class="row-1">17.09.16</li>
<li class="row-3"><a href="./?act=view&bo_table=biz_zhaopin&id=987479&page=2">북대시장부근 부동산에서 조선족 엄무원 모집합니다. </a></li>
<li class="row-4"></li>
<li class="row-5">3</li>
</ul>
<ul class="i-item">
<li class="row-1">17.09.16</li>
<li class="row-3"><a href="./?act=view&bo_table=biz_zhaopin&id=998960&page=2">주방이모구합니다 하루6시간출근 주1회휴무 </a></li>
<li class="row-4"></li>

이 질문에 댓글 쓰기 :

답변 1

1번, 2번 다 잘되는것 같습니다.



<?php
$contents = file_get_contents('contents.html');

// 1.
$regexp = '~<ul\sclass="i-item">\s+.+?<a\s+href="(./?.act=[^"]+)">([^<]+)\s+.+?<\/li>~isx';
// 2.
// $regexp = '~<ul\sclass="i-item">\s+.+?<a\s+href="([^"]+)">([^<]+)\s+.+?<\/li>~isx';
$lists  = [];
preg_match_all($regexp, $contents, $lists);
echo count($lists);

print('<pre><xmp>');
print_r($lists);
print('</xmp></pre>');
?>
답변을 작성하시기 전에 로그인 해주세요.
전체 129,114 | RSS
QA 내용 검색

회원로그인

(주)에스아이알소프트 / 대표:홍석명 / (06211) 서울특별시 강남구 역삼동 707-34 한신인터밸리24 서관 1402호 / E-Mail: admin@sir.kr
사업자등록번호: 217-81-36347 / 통신판매업신고번호:2014-서울강남-02098호 / 개인정보보호책임자:김민섭(minsup@sir.kr)
© SIRSOFT