파싱정규식 봐주세요 ㅠㅠ
본문
바트파싱기로 여기에 리스트 뽑을려고 하는데 제실력으로 하다가 고수님들한테 질문 드립니다
제가 기존에 만든 바트정규식 코드는
1.
~<ul\sclass="i-item">\s+.+?<a\s+href="(./?.act=[^"]+)">([^<]+)\s+.+?<\/li>~isx
2.
~<ul\sclass="i-item">\s+.+?<a\s+href="([^"]+)">([^<]+)\s+.+?<\/li>~isx
둘다 안되네요
전체코드 복사하여 올려드립니다
<!DOCTYPE html> | |
<html> | |
<head> | |
<meta charset="UTF-8"> | |
<title>연변구인정보 - 연변 이얼싼</title> | |
<meta name="keywords" content="123123,延边招聘,延边人才,延边房产,延边二手,延边二手车,延边黄页" /> | |
<meta name="Description" content="123分类信息网,免费、高效的生活服务平台! 找租房,找二手房,找工作,找兼职,买卖二手,二手车交易,买卖宠物,本地上门服务等海量的生活分类信息,是您最好的免费发布信息网站。" /> | |
<meta http-equiv="X-UA-Compatible" content="IE=edge,chrome=1"> | |
<meta name="viewport" content="width=device-width,initial-scale=1,minimum-scale=1,maximum-scale=1,user-scalable=no"> | |
<link rel="apple-touch-icon" href="../mobile_icon.png"> | |
<link rel="apple-touch-icon-precomposed" href="../mobile_icon.png"> | |
<link rel="shortcut icon" href="../favicon.ico?1" /> | |
<link rel="stylesheet" href="pop/styles/pop.css?ver=2017072409"> | |
<link rel="stylesheet" href="./styles/pc/style.css?ver=2017072409"> | |
<script src="./styles/pc/js/jquery.min.js"></script> | |
<script src="pop/styles/js/pop.js?ver=2017072409"></script> | |
<script src="./styles/pc/js/jquery.nicescroll.js"></script> | |
<script src="./styles/pc/js/common.js?ver=2017072409"></script> | |
<script src="./lang/kr/message.js?ver=2017072409"></script> | |
<script src="./styles/common.js?ver=2017072409"></script> | |
<!--[if lt IE 9]> | |
<script src="styles/pc/js/html5.js"></script> | |
<![endif]--> | |
</head> | |
<body> | |
<div class="page-body"> | |
<div class="page-body-in"> | |
<div class="page-wrap"> | |
<!-- header start --> | |
<header class="header"> | |
<div class="layout"> | |
<div class="logo"><a href="./"><img src="styles/pc/img/logo.png" alt="" border="0"></a></div> | |
<nav class="top-city-nav"> | |
<div class="city-toggle"><span class="city-cur">연변</span></div> | |
<div class="city-menu"> | |
<i class="ico-arrow"></i> | |
<ul> | |
<li><a href="change_lang.php?lang=beijing">북경</a></li> | |
<li><a href="change_lang.php?lang=shanghai">상해</a></li> | |
<li><a href="change_lang.php?lang=tianjin">천진</a></li> | |
<li><a href="change_lang.php?lang=yanbian">연변</a></li> | |
<li><a href="change_lang.php?lang=guangdong">광동</a></li> | |
<li><a href="change_lang.php?lang=shandong">산동</a></li> | |
<li><a href="change_lang.php?lang=liaoning">료녕</a></li> | |
<li><a href="change_lang.php?lang=zhejiang">절강</a></li> | |
<li><a href="change_lang.php?lang=jiangsu">강소</a></li> | |
</ul> | |
</div> | |
</nav> | |
<div class="top-nav-toggle" onClick="leftMenuShow()">메뉴</div> | |
</div> | |
</header> | |
<!-- | |
说明:默认展开菜单 | |
=============================== | |
<li class="active"> | |
<ul style="display:block;"> | |
... | |
</ul> | |
</li> | |
style="display:none; left:-300px" | |
--> | |
<nav class="top-nav-bd" > | |
<div class="nav-inner"> | |
<ul> | |
<li class="member-menu"> | |
<h4><a class="m-reg" href="./member/?act=login">로그인</a><a class="m-login" href="./member/?act=join">회원가입</a></h4> </li> | |
<li><span class="menu"><a href="./">홈으로</a></span></li> | |
<li><a href="./?act=list&bo_table=biz_caipu">123菜谱</a></li> | |
<li class="active"> | |
<h4>지역정보 </h4> | |
<ul style="display:block;"> | |
<li><span class="menu"><a href="./?act=list&bo_table=biz_zhaopin">구인정보</a></span></li> | |
<li><a href="./?act=list&bo_table=biz_qiuzhi">구직정보</a></li> | |
<li><a href="./?act=list&bo_table=biz_jianzhi">알바정보</a></li> | |
<li><a href="./?act=list&bo_table=biz_fang">부동산 정보</a></li> | |
<li><a href="./?act=list&bo_table=biz_maimai">벼룩시장</a></li> | |
<li><a href="./?act=list&bo_table=biz_che">중고차</a></li> | |
<li><a href="./?act=list&bo_table=biz_shangjia">업체광고</a></li> | |
<li><a href="./?act=list&bo_table=biz_jiaoyou">친구찾기</a></li> | |
<li><a href="./?act=list&bo_table=biz_mall">쇼핑몰 홍보</a></li> | |
<li><a href="./?act=list&bo_table=biz_jiaoyu">교육정보</a></li> | |
<li><a href="./?act=list&bo_table=biz_haoma">전화번호</a></li> | |
<li><a href="./?act=list&bo_table=biz_youxi">게임관련</a></li> | |
<li><a href="./?act=list&bo_table=biz_chongwu">애완동물</a></li> | |
<li><a href="./?act=list&bo_table=zhidao">도움요청</a></li> | |
</ul> | |
</li> | |
<li> | |
<h4>출국유학 </h4> | |
<ul style="display:block;"> | |
<li><a href="./?act=list&bo_table=biz_chuguonews">새소식/공지</a></li> | |
<li><a href="./?act=list&bo_table=biz_chuguoqa">출국지식</a></li> | |
<li><a href="./?act=list&bo_table=biz_chuguo">출국정보</a></li> | |
</ul> | |
</li> | |
<li> | |
<h4>고객센터 </h4> | |
<ul> | |
<li><a href="./about/bbs_guide.php">게시판 이용약관</a></li> | |
<li><a href="./about/ad.php">광고문의</a></li> | |
<li><a href="./about/intro.php">사이트소개</a></li> | |
</ul> | |
</li> | |
<li><a class="lang" href="./change_lang.php?lang=cn">中文版</a></li> </ul> | |
</div> | |
</nav> | |
<!--// header end --> | |
<div class="layout"> | |
<div class="list-tit-bar"> | |
<h2 class="tit"><a href="./?act=list&bo_table=biz_zhaopin">구인정보</a></h2> | |
<div class="u-acts"> | |
<a class="btn" href="javascript:showMsg('로그인 후 사용 가능합니다.')">내가쓴글</a> | |
<a class="btn btn-red" href="javascript:showMsg('로그인 후 글쓰기 가능합니다.')">글쓰기</a> | |
</div> | |
</div> <!-- tabs --> | |
<div class="list-tas-bar"> | |
<ul> | |
<li class="current"><a href="./?act=list&bo_table=biz_zhaopin">전체</a></li> | |
<li><a href="./?act=list&bo_table=biz_zhaopin&cate1=1">외식/오락</a></li> | |
<li><a href="./?act=list&bo_table=biz_zhaopin&cate1=2">판매/매장관리</a></li> | |
<li><a href="./?act=list&bo_table=biz_zhaopin&cate1=3">사무직</a></li> | |
<li><a href="./?act=list&bo_table=biz_zhaopin&cate1=4">상담/영업</a></li> | |
<li><a href="./?act=list&bo_table=biz_zhaopin&cate1=5">IT/디자인</a></li> | |
<li><a href="./?act=list&bo_table=biz_zhaopin&cate1=6">서비스업</a></li> | |
<li><a href="./?act=list&bo_table=biz_zhaopin&cate1=7">생산/건설</a></li> | |
<li><a href="./?act=list&bo_table=biz_zhaopin&cate1=8">강사/교육</a></li> | |
</ul> | |
</div> | |
<!-- list main start --> | |
<div class="list-main"> | |
<!-- list --> | |
<section class="list-wrap img-txt-list"> | |
<ul class="i-item"> | |
<li class="row-1">17.09.16</li> | |
<li class="row-3"><a href="./?act=view&bo_table=biz_zhaopin&id=996074&page=2">龙井招聘口腔护士兼前台 </a></li> | |
<li class="row-4"></li> | |
<li class="row-5">7</li> | |
</ul> | |
<ul class="i-item"> | |
<li class="row-1">17.09.16</li> | |
<li class="row-3"><a href="./?act=view&bo_table=biz_zhaopin&id=998884&page=2">厨房阿姨及配菜 </a></li> | |
<li class="row-4"></li> | |
<li class="row-5">4</li> | |
</ul> | |
<ul class="i-item"> | |
<li class="row-1">17.09.16</li> | |
<li class="row-3"><a href="./?act=view&bo_table=biz_zhaopin&id=999572&page=2">호프요리 할줄 아시는분 모집합니다. </a></li> | |
<li class="row-4"></li> | |
<li class="row-5">9</li> | |
</ul> | |
<ul class="i-item"> | |
<li class="row-1">17.09.16</li> | |
<li class="row-3"><a href="./?act=view&bo_table=biz_zhaopin&id=999647&page=2">옷가게에서 직원급히구합니다 </a></li> | |
<li class="row-4"></li> | |
<li class="row-5">78</li> | |
</ul> | |
<ul class="i-item"> | |
<li class="row-1">17.09.16</li> | |
<li class="row-3"><a href="./?act=view&bo_table=biz_zhaopin&id=998149&page=2">쇼핑몰 창업 직원 모집함니다 (남 녀) </a></li> | |
<li class="row-4"></li> | |
<li class="row-5">19</li> | |
</ul> | |
<ul class="i-item"> | |
<li class="row-1">17.09.16</li> | |
<li class="row-3"><a href="./?act=view&bo_table=biz_zhaopin&id=999756&page=2">룡정고기집 여직원급구 </a></li> | |
<li class="row-4"></li> | |
<li class="row-5">36</li> | |
</ul> | |
<ul class="i-item"> | |
<li class="row-1">17.09.16</li> | |
<li class="row-3"><a href="./?act=view&bo_table=biz_zhaopin&id=996067&page=2">아리랑헤어샵에서 헤어디자이너 스텝을 모집합니다. </a></li> | |
<li class="row-4"></li> | |
<li class="row-5">12</li> | |
</ul> | |
<ul class="i-item"> | |
<li class="row-1">17.09.16</li> | |
<li class="row-3"><a href="./?act=view&bo_table=biz_zhaopin&id=999517&page=2">컴퓨터 핸드폰간편하게 </a></li> | |
<li class="row-4"></li> | |
<li class="row-5">84</li> | |
</ul> | |
<ul class="i-item"> | |
<li class="row-1">17.09.16</li> | |
<li class="row-3"><a href="./?act=view&bo_table=biz_zhaopin&id=999565&page=2">김밥집에서 주방보조구합니다 </a></li> | |
<li class="row-4"></li> | |
<li class="row-5">69</li> | |
</ul> | |
<ul class="i-item"> | |
<li class="row-1">17.09.16</li> | |
<li class="row-3"><a href="./?act=view&bo_table=biz_zhaopin&id=993301&page=2">회관 여복무원을 모집합니다. </a></li> | |
<li class="row-4"></li> | |
<li class="row-5">6</li> | |
</ul> | |
<ul class="i-item"> | |
<li class="row-1">17.09.16</li> | |
<li class="row-3"><a href="./?act=view&bo_table=biz_zhaopin&id=987479&page=2">북대시장부근 부동산에서 조선족 엄무원 모집합니다. </a></li> | |
<li class="row-4"></li> | |
<li class="row-5">3</li> | |
</ul> | |
<ul class="i-item"> | |
<li class="row-1">17.09.16</li> | |
<li class="row-3"><a href="./?act=view&bo_table=biz_zhaopin&id=998960&page=2">주방이모구합니다 하루6시간출근 주1회휴무 </a></li> | |
<li class="row-4"></li> | |
답변 1
1번, 2번 다 잘되는것 같습니다.
<?php
$contents = file_get_contents('contents.html');
// 1.
$regexp = '~<ul\sclass="i-item">\s+.+?<a\s+href="(./?.act=[^"]+)">([^<]+)\s+.+?<\/li>~isx';
// 2.
// $regexp = '~<ul\sclass="i-item">\s+.+?<a\s+href="([^"]+)">([^<]+)\s+.+?<\/li>~isx';
$lists = [];
preg_match_all($regexp, $contents, $lists);
echo count($lists);
print('<pre><xmp>');
print_r($lists);
print('</xmp></pre>');
?>
답변을 작성하시기 전에 로그인 해주세요.